Straight from the Book 


Titu Andreescu 
Gabriel Dospinescu 


Straight from the Book 


Press 


Titu Andreescu Gabriel Dospinescu 
University of Texas at Dallas Ecole Normale Supérieure, Lyon 


Library of Congress Control Number: 2012951362 
ISBN-13: 978-0-9799269-3-8 ISBN-10: 0-9799269-3-9 
© 2012 XYZ Press, LLC 


All rights reserved. This work may not be translated or copied in whole or in 
part without the written permission of the publisher (XYZ Press, LLC, 3425 
Neiman Rd., Plano, TX 75025, USA) and the authors except for brief excerpts 
in connection with reviews or scholarly analysis. Use in connection with any 
form of information storage and retrieval, electronic adaptation, computer 
software, or by similar or dissimilar methodology now known or hereafter 
developed is forbidden. The use in this publication of tradenames, trademarks, 
service marks and similar terms, even if they are not identified as such, is not 
to be taken as an expression of opinion as to whether or not they are subject 
to proprietary rights. 


987654321 
www.awesomemath.org 


Cover design by Iury Ulzutuev 


The only way to learn mathematics is te do mathematics. 


-Paul Halmos 


Foreword 


This book is a follow-on from the authors’ earlier book, ‘Problems from 
the Book’. However, it can certainly be read as a stand-alone book: it is not 
vital to have read the earlier book. 

The previous book was based around a collection of problems. In contrast, 
this book is based around a collection of solutions. These are solutions to 
some of the (often extremely challenging) problems from the earlier book. 
The topics chosen reflect those from the first twelve chapters of the previous 
book: so we have Cauchy-Schwarz, Algebraic Number Theory, Formal Series, 
Lagrange Interpolation, to name but a few. 

The book is one of the most remarkable mathematical texts I have ever 
seen. First of all, there is the richness of the problems, and the huge variety 
of solutions. The authors try to give several solutions to each problem, and 
moreover give insight about why each proof is the way it is, in what way the 
solutions differ from each other, and so on. The amount of work that has been 
put in, to compile and interrelate these solutions, is simply staggering. There 
is enough here to keep any devotee of problems going for years and years. 

Secondly, the book is far more than a collection of solutions. The solu- 
tions are used as motivation for the introduction of some very clear expositions 
of mathematics. And this is modern, current, up-to-the-minute mathematics. 
For example, a discussion of Extremal Graph Theory leads to the celebrated 
Szemerédi-Trotter theorem on crossing numbers, and to the amazing applica- 
tions of this by Székely and then on to the very recent sum-product estimates 
of Elekes, Bourgain, Katz, Tao and others. This is absolutely state-of-the-art 
material. It is presented very clearly: in fact, it is probably the best exposition 
of this that I have seen in print. 
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As another example, the Cauchy-Schwarz section leads on to the devel- 
opements of sieve theory, like the Large Sieve of Linnik and the Turdn-Kubilius 
inequality. And again, everything is incredibly clearly presented. The same 
applies to the very large sections on Algebraic Number Theory, on p-adic 
Analysis, and many others. 

It is quite remarkable that the authors even know so much current math- 
ematics — I do not think any of my colleagues would be so well-informed over 
so wide an area. It is also remarkable that, at least in the areas in which I 
am competent to judge, their explanations of these topics are polished and 
exceptionally well thought-out; they give just the right words to help someone 
understand what is going on. 

Overall, this seems to me like an ‘instant classic’. There is so much 
material, of such a high quality, wherever one turns. Indeed, if one opens the 
book at random (as I have done several times), one is pulled in immediately 
by the lovely exposition. Everyone who loves mathematics and mathematical 
thinking should acquire this book. 


Imre Leader | 
Professor of Pure Mathematics 
University of Cambridge 


Preface 


This book is a compilation of many suggestions, much advice, an even 
more hard work. Its main objective is to provide solutions to the problems 
which were originally proposed in the first 12 chapters of Problems from the 
Book, We were not able to provide full solutions in our first volume due to the 
lack of space. In addition, the statements of the proposed problems contained - 
typos and some elementary mistakes which needed further editing. Finally, 
the problems were also considered to be quite difficult to tackle. With these 
points in mind, we came up with a two-part plan: to correct the identified 
errors and to publish comprehensive solutions to the problems. 

The first task, editing the statements of the proposed problems, was sim- 
ple and has already been completed in the second edition of Problems from the 
Book. Although we focused on changing several problems, we also introduced 
many new ones. The second task, providing full solutions, however, proved to 
be more challenging than expected, so we asked for help. We created a forum 
on www.mathlinks.ro (a familiar site for problem-solving enthusiasts) where 
solutions to the proposed problems were gathered. It was a great pleasure to 
witness the passion with which some of the best problem-solvers on mathlinks 
attacked these tough-nuts. This new book is the result of their common effort, 
and we thank them. 

Providing solutions to every problem within the limited space of one vol- 
ume turned out to be an optimistic plan. Only the solutions to problems 
from the first 12 chapters of the second edition of Problems from the Book are 
presented here. Furthermore, many of the problems are difficult and require 
a rather extensive mathematical background. We decided, therefore, to com- 
plement the problems and solutions with a series of addenda, using various 
problems as starting points for excursions into real mathematics”. Although 


we never underestimated the role of problem solving, we strongly believe that 
the reader will benefit more from embarking on a mathematical journey rather 
than navigating a huge list of scattered problems, This book tries to reconcile 
problem-solving with ” professional mathematics” . 

Let us now delve into the structure of the book which consists of 12 chap- 
ters and presents the problems and solutions proposed in the corresponding 
chapters of Problems from the Book, second edition. Many of these problems 
are fairly difficult and different approaches are presented for a majority of 
them. At the end of each chapter, we acknowledge those who provided so- 
lutions. Some chapters are followed by one or two addenda, which present 
topics of more advanced mathematics stemming from the elementary topics 
discussed in the problems of that chapter. 

The first two chapters focus on elementary algebraic inequalities (a no- 
table caveat is that some of the problems are quite challenging) and there is 
not much to comment regarding these chapters except for the fact that al- 
gebraic inequalities have proven fairly popular at, mathematics competitions. 
To provide relief from this rather dry landscape (the reader will notice that 
most of the problems in these two chapters start with "Let a,6,¢ be positive 
real numbers”), we included an addendum presenting deep applications of the 
Cauchy-Schwarz inequality in analytic number theory. For instance, we dis- 
cuss Gallagher’s sieve, Linnik’s large sieve and its version due to Montgomery. 
We apply these results to the distribution of prime numbers (for instance 
Brun’s famous theorem stating that the sum of the inverses of the twin primes 
converges). The note-worthy Turdn-Kubilius inequality and its classical appli- 
cations (the Hardy-Ramanujan theorem on the distribution of prime factors of 
n, Erdés’ multiplication table problem, Wirsing’s generalization of the prime 
number theorem) are also discussed. The reader will be exposed to the power 
of the Cauchy-Schwarz inequality in real mathematics and, hopefully, will un- 
derstand that the gymnastics of three-variables algebraic inequalities is not 
the Holy Grail. 

Chapter 3 discusses problems related to the unique factorization of inte- 
gers and the p-adic valuation maps. Among the topics discussed, we cover 
the local-global principle (which is extremely helpful in proving divisibilities 
or arithmetic identities), Legendre’s formula giving the p-adic valuation of n!, 
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a beneficial elementary result called lifting the exponent lemma, as well as 
more advanced techniques from p-adic analysis, One of the most beautiful re- 
sults presented in this chapter in the celebrated Skolem-Mahler-Lech theorem 
concerning the zeros of a linearly recursive sequence. Readers will perhaps 
appreciate the applications of p-adic analysis, which is covered extensively in 
a long addendum. This addendum discusses, from a foundational level, the 
arithmetic of p-adic numbers, a subject. that plays a central role in modern 
number theory. Once the basic groundwork has been laid, we discuss the 
p-adic analogues of classical functions (exponential, logarithm, Gamma func- 
tion) and their applications to difficult congruences (for instance, Kazandzidis’ 
famous supercongruence). This serves as a good opportunity to explore the 
arithmetic of Bernoulli numbers, Volkenborn’s theory of p-adic integration, 
Mahler and Amice’s classical theorems characterizing continuous and locally 
analytic functions on the ring of p-adic integers or Morita’s construction of the 
p-adic Gamma function. A second addendum to this chapter discusses various 
classical estimates on prime numbers, which are used throughout the book. 


Chapter 4 discusses problems and elementary topics related to prime num- 
bers of the form 4k + 1 and 4k + 3. The most intriguing problem discussed 
is, without any doubt, Cohn’s renowned theorem characterizing the perfect 
squares in the Lucas sequence. Chapter 5 is dedicated again to the yoga of 
algebraic inequalities and is followed by an addendum discussing applications 
of Hélder’s inequality. 

Chapter 6 focuses on extremal graph theory. Most of the problems revolve 
around Turan’s theorem, however the reader will also be exposed to topics 
such as chromatic numbers, bipartite graphs, etc. This chapter is followed 
by a relatively advanced addendum, discussing various topics related to the 
Szemerédi-Trotter theorem, which gives bounds for the number of incidences 
between a set of points and a set of curves. We discuss the theorem’s classical 
probabilistic proof, its generalization to multi-graphs due to Székely and its 
application to the sum-product problem due to Elekes, as well as more recent, 
developments due to Bourgain, Katz, and Tao. These results are then applied 
to natural and nontrivial geometric questions (for instance: what is the least 
number of distinct distances determined by n points in the plane? what is the 
maximal number of triangles of the same area?}. Finally, another addendum 
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completes this chapter and is dedicated to the powerful probabilistic method. 
After a short discussion on finite probability spaces, we provide many examples 
of combinatorial applications. 

Chapter 7 involves combinatorial and number theoretic applications of 
finite Fourier analysis. The central principles of this chapter include the roots 
of unity and the fact that congruences between integers can be expressed in 
terms of sums of powers of roots of unity. To provide the reader with a broader 
view, we beefed-up this chapter with an addendum discussing Fourier analysis 
on finite abelian groups and applying it to Gauss sums of Dirichlet characters, 
additive problems, combinatorial or analytic number theory. For instance, 
the reader will find a discussion of the Pélya-Vinogradov inequality and Vino- 
gradov’s beautiful use of this inequality to deduce a rather strong bound on 
the least quadratic non-residue mod p. At the same time, we explore Dirich- 
let’s L-functions, culminating in a proof of Dirichlet’s theorem on arithmetic 
progressions. This section is structured to first present the usual analytic proof 
(since it is really a masterpiece from all points of view), up to some important 
simplification due to Monsky. At the same time, we also discuss how to turn 
this into an elementary proof that avoids complex analysis and dramatically 
uses Abel’s summation formula. 

Chapter 8 focuses on diverse applications of generating functions. This 
is an absolutely crucial tool in combinatorics, be it additive or enumerative. 
In this section, the reader will have the opportunity to explore enumerative 
problems (Catalan’s problem, counting the number of solutions of linear dio- 
phantine equations or the number of irreducible polynomials mod p, of fixed 
degree). We also discuss exotic combinatorial identities or recursive sequences, 
which can be solved elegantly using generating functions, but also rather chal- 
lenging congruences that appear so often in number theory. The chapter is 
followed by an addendum presenting a very classical topic in enumerative 
combinatorics, Lagrange’s inversion formula. Aniong the applications, let us 
mention Abel’s identity, other derived combinatorial identities, Cayley’s the- 
orem on labeled trees, and various related problems. 

Chapter 9 is rather extensive, due to the vast nature of the topic covered, 
algebraic number theory. While we are able only to scratch the surface, nev- 
ertheless the reader will find a variety of intriguing techniques and ideas. For 
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instance, we discuss arithmetic properties of cyclotomic polynomials (inchud- 
ing Mann’s beautiful theorem on linear equations in roots of unity), rationality 
problems, and various applications of the theorem of symmetric polynomials. 
In addition, we present techniques rooted in the theory of ideals in number 
fields, finite fields and p-adic methods. We also give an overview of the el- 
ementary algebraic number theory in the addendum following this chapter. 
After a brief review on ideals, field extensions, and algebraic numbers, we 
proceed with a discussion of the primitive element theorem and embeddings 
of number fields into C. We also briefly survey Galois theory, and the fun- 
damental theorem on the prime factorization of ideals in number fields, due 
to Kummer and Dedekind. Once the foundation has been set, we discuss the 
prime factorization in quadratic and cyclotomic fields and apply these tech- 
niques to basic problems that explore the aforementioned theories. Finally, we 
discuss various applications of Bauer’s theorem and of Chebotarev’s theorem. 
The next addendum is concerned with the fascinating topic of counting the 
number of solutions modulo p of systems of polynomial equations. We use 
this as an opportunity to state and prove the basic structural results on finite 
fields and introduce the Gauss and Jacobi sums. We go ahead and count the 
number of points over a finite field of a diagonal hypersurface and to compute 
its zeta function. This is a beautiful theorem of Weil, the very tip of a massive 
iceberg. 

Chapter 10 focuses on the arithmetic of polynomials with integer coeffi- 
cients. An essential aspect of the discussion concentrates on Mahler expan- 
sions, the theory of finite differences, and their applications. The techniques 
used in this chapter are rather diverse. Although the problems can be consid- 
ered basic, they are challenging and require advanced problem-solving skills. 

Chapter 11 provides respite from the difficult tasks mentioned above. It 
discusses Lagrange’s interpolation formula, allowing a unified presentation of 
various estimates on polynomials. 

The longest and certainly most challenging chapter is the last one. It 
explores several algebraic techniques in combinatorics. The methods are stan- 
dard but powerful. The last part of the chapter deals with applications to 
geometry, presenting some of Dehn’s wonderful ideas. The last problem pre- 
sented in the book is the farnous Freiling, Laczkovich, Rinne, Szekeres theorem, 
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a stunningly beautiful application of algebraic combinatorics. 

We would like to thank, again, the members of the mathlinks site for 
their invaluable contribution in providing solutions to many of the problems 
in this book. Special thanks are due to Richard Steng, who did a remarkable 
job by pointing out many inaccuracies and suggesting numerous alternative 
solutions. We would also like to thank Joshua Nichols-Barrer, Kathy Cordeiro 
and Radu Sorici, who gave the manuscript a readable form and corrected 
several infelicities. Many of the problems and results in this book were used by 
the authors in courses at the AwesomeMath Summer Program, and students’ 
reactions guided us in the process of simplifying or adding more details to the 
discussed problems. We wish to thank them all, for their courage in taking 
and sticking with these courses, as well as for their valuable suggestions. 


Titu Andreescu Gabriel Dospinescu 
titu.andreescu@utdallas.edu gdospi2002@yaheo.com 


Contents 


1 Some Useful Substitutions i 
Li The relation a? +6? 4c? =abe+4....0,....0.00.. 2 
1.2 The relations abe = a+6+¢e+2and ab+be+ca+2abe=1. 9 
1.3 The relation a? +6? +c? 4+2abe=1 2.00.00... .000, 21 
LA. Notes xe. fe eee 2G i Ben ed i Se cee he Be 27 

2 Always Cauchy-Schwarz... 29 
BA. NOES: eee Soe ee RR ER we Se ed 61 
Addendum 2.4 Cauchy-Schwarz in Number Theory ......... 62 

3 Look at the Exponent 91 
Bil <Teitroduction:s 5-2.304. ¢anneats- A WiAsa ae thd a lewb ere Geese as 9t 
3.2 Local-global principle 92 
3.3 Legendre’s formula... 2.0... ee ee 96 
3.4 Problems with combinatorial and valuation-theoretic aspects . 104 
3.5 Lifting exponent lemma .. 2... ..--......-..00.. 110 
3.6 p-adic techniques... ee 116 
3.7 Miscellaneous problems . 2.2.2.0... 0000002 eee 125 
SiS: | NOES so oe nis hot a Gen bese vate oe, Ge aoeneare ee bak Sk 134 
Addendum 3.A Classical Estimates on Prime Numbers ..... . 135 
Addendum 3.B An Introduction to p-adic Numbers .,....... 141 

4 Primes and Squares 189 


ALL INGURR:: 2.05 a. 1. 9) SNe thee Gace ® Gute 3 Ske a ah es Se ek et 203 


xvi Contents Contents xvii 


5 To’s Lemma 205 106 Arithmetic Properties of Polynomials 485 
Bak nINOGES 45.2 ee. Se tSy nats fic, we than eda ha dct, Ca, at 296 10.1 The a—b|f{a)— f(b) trick 2. ee en 485 
Addendum §.A Hélder’s Inequality in Action... 0.00.00... 227 10.2 Derivatives and p-adic Taylor expansions... 2... ...-.. 494 

10.3 Hilbert polynomials and Mahler expansions ........... A497 

6 Some Classical Problems in Extremal Graph Theory 235 10.4 p-adic estimates. ee ee 506 
Of Notes: 52 Gol a tae ok Beating do a4 py aes 10.5 Miscellaneous problems ........0..0.0..-00000 uuu 513 
Addendum 6.A Some Pearls of Extremal Graph Theory z LOG NOGA” ana Wahid fe ae eso eerste aio Sk fis ROO 520 
Addendum 6.B Probabilities in Combinatorics j 

11 Lagrange Interpolation Formula 521 

7 Complex Combinatorics TLE, Notes isc s ecielg 255 gue! se an edhe tlhe So Qader an Sot Sees 537 
7.1 Tiling and coloring problems ..........0.......00- 

7.2 Counting problems... 00.0.0... 12 Higher Algebra in Combinatorics 539 
7.3 Miscellaneous problems .................00-04 12.1 The determinant trick .......0.020..2.0 2000-00004 541 
Hid NOUed coo Bo hc She Rialog ca at as oe cok ake nels 2 12.2 Matrices over Fp 2... 00.0020. 546 
Addendum 7.A Finite Fourier Analysis ©... 0. 00 00 ee ee 12.3 Applications of bilinear algebra 552 

12.4 Matrix equations .. 0... 0... ee 561 

8 Formal Series Revisited 337 12.5 The linear independence trick... 2.2... 0 ee eee 568 
8.1 Counting problems... 02. ee 339 12.6 Applications to geometry . 2.2... 576 
8.2 Proving identities using generating functions ........ 4... 349 ADE NOES 25.5! 4 bie eels PREC SLAY Le kee he 583 
8.3 Recurrence relations . 6... 0. 354 
8.4 Additive properties... 2.2... ee. 861 Bibliography 585 
8.5 Miscellaneous problems 2.00.00 0. ee 371 
8:6: Notes 2.8 P he oc ns ence tee tye nWenahe eaten & Aloe ee Sha 375 
Addendum 8.A Lagrange’s Inversion Theorem ............ 377 

9 A Little Introduction to Algebraic Number Theory 395 
9.1 Tools from linear algebra 2. 396 
9:2. Cy lowomny sé 4 syle eaiac he a A caea ae dig, Pun ind an sete tes 402 
913° The: ged’ tricks” 2-5-3 5.2 Sct, fF Apa eS odewelgs oasis a 408 
94 The theorem of symmetric polynomials... . 0... . 410 
9.5 Ideal theory and local methods ... 2.2.0.0... ... 420 
9.6 Miscellaneous problems ................. 00004 426 
Oe Notes fo ep ane el eh oe ph died. hia Sod Lak eae 433 
Addendum 9,A Equations over Finite Fields 2.0 .0.00..000004 434 


Addendum 9.B A Glimpse of Algebraic Number Theory ..... . 456 


Chapter 1 


Some Useful Substitutions 


Let us first recall the classical substitutions that will be used in the fol- 
lowing problems. All of these are discussed in detail in [3], chapter 1 and the 
reader is invited to take a closer look there. 

Consider three positive real numbers a, b,c. If abe = 1, a classical substi- 
tution is 


x£ q z 
om, beh ee = 
y Zz 
A less classical one is 
£ y 28 
YY beZ? zen’ ° aty 


(for some positive real numbers 2, y, 2) whenever ab + be + ca + 2abe = 1, or 
its equivalent version 
yYt2z Z+z E+y 
a= c= 


| adi Roce 
a? y z 


when abc = a+6+¢ + 2 (the equivalence between the two relations follows 
from the substitution (a, b,c) + G, is 4)). Two other very useful substitutions 
concern the relations a? + 6? +¢? = abe +4 and a? +8? + ¢? + 2abe = 1. In the 
first. case, with the extra assumption max(a,6,c) > 2,,one can find positive 
numbers x,y,z such that xyz = 1 and 


i 
Q=Et—, py! 


1 
Ep R, 
z y 2 
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In the second case one can find an acute-angled triangle ABC such that 
a= cos(A), b= cos(B), c= cos(C). 


Of course, in practice one often needs to use a mixture of these substitutions 
and to be rather familiar with classical identities and inequalities. But expe- 
rience comes with practice, so let us delve into some exercises and problems 
to see how things really work. 


1.1 The relation a? +0? +c? = abc +4 


We start with an easy exercise, based on the resolution of a quadratic 
equation. 


1. Prove that if a,b,¢ > 0 satisfy {a? +b? + ¢? — 4] = abe, then 
(a — 2)(b-2) + B—2(e—2) + (c~ Bla ~2) DO. 
Titu Andreescu, Gazeta Matematica 
Proof. If max(a, b,c) < 2, then everything is clear, so assume that 
max(a, b,c) > 2, 


Then a? + 0? +c? — abe = 4, so there exist positive numbers 2, 7, z such that 
xyz = land 


1 1 xa 
@=2+-, b=ytm, c= zte—. 
z y z 


But then a,b,c > 2 and we are dene again. 


Proof. The most natural idea is to consider the hypothesis as a quadratic 
equation in a, for instance. It becomes a? + abc + 6? +c? ~ 4 = 0, and solving 
the equation yields 


1.1. The relation a? + b? + c? = abe+4 3 


Thus (8 — 4)(c? -- 4) = (be + 2a)®, which can also be written as 


(bc + 2a)? 
b—2)}(¢ - 2) = --- > 0. 
(- N(e-2) = ET e+y 2° 
Writing similar expressions for the other two variables, we are done. oO 


The following exercise is trickier and one needs some algebraic skills in 
order to solve it. We present two solutions, neither of which is really easy. 


2. Find all triples x,y,z of positive real numbers such that 


ety te =nyz+4 
wy + yz + ze = Ue -+ytz) 


Proof. By the second equation we have max(z, y, z) > 2 and so the first equa- 
tion yields the existence of positive real numbers a, ,¢ such that 


and abe = 1. i 

1. oe i 

Yo (o+ +58) = 250 («+ *). 
Since abc = 1, we have 
1 omy 1 
vl-Da Cb-ye 

so the second equation can be written 

ab os 

s (j he 2) =? Soa a Se ab. 


The left-hand side is also equal to 


> c(a? +b?) = (Ss a) os ab) ~ 3, 
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because ; 24K 
a,b _ ath a ye 
gta ab = c(a* + b*). 
We deduce that (S7a@—1) (> ab— 1) = 4. Since Ja > 3 and Slab > 3 (by 
the AM-GM inequality and the fact that abe = 1), this can only happen if 
a=bh=ec= 1 and thus when ¢ = y = 2 = 2. Oo 


Proof. If x+y = 2, the second equation yields xy = 4, so that (ey)? = ~12 
which is a contradiction. Thus a + y ¥ 2 and similarly y+ 2 42, 2+a 4 2. 
The second equation yields 


4—xy 
r+y—2° 
and a rather brutal insertion of this expression in the first equation gives 


4— 2 (4—-«y)* 
@-yt+ (4 |) ear 
gE+y—2 2-2-y 


Unless « = y = 2, this implies the inequality 2 > a+ y. H two of the numbers 
x,y,z are equal to 2, then trivially so is the third one. If not, the previous 
argument shows that 2>2+y,2>y+zand2> 2+ 2, But then the second 
equation yields 


styte> Doe(4*) = ryt ys+ en = Wnty +z), 
cue 


a contradiction, Thus, the only solution is ¢ = y = z = 2. 


The following problem hides under a clever algebraic manipulation a very 
simple AM-GM argument. The inequality is quite strong, as the reader can 
easily see by trying a brute-force approach. 


3. Prove that if a,b,c > 2 satisfy a? + 0? +c? = abe +4, then 


atb+ctab+actbe> 2/atb+ot+3 (+e +e — 3), 


Marian Tetiva 


1.1, The relation? +8 +e = abe+4  _ 5 


Proof. The hypothesis yields the existence of positive real numbers , y, z such 


inet x yz or 218 
of ¥ b= ¥ +4 c d : 


a 2° y gO ae 
The miracle is that both sides of the inequality have very nice factorizations. 
For the right-hand side, this is easy to observe, since 


Qs 


: Py ail? cc 
atb+e4+3 = (xy t+ yz + zz) oy ue tae 


and 


#+8+2-3-5 (544) +3 = (02) (Sa). 


For the left-hand side, things are more subtle, but one finally reaches the 
identity 


atb+ctab+be+ca= (So) (x) 4 (=a) (Soa). 


The desired inequality becomes simply the AM-GM inequality for two num- 


bers! o 


The following problems are rather tricky mixtures of algebraic manipula- 
tions and elementary number theory. 


4. Find all triplets of positive integers (k,1,m)} with sum 2002 and for which 
the system 


Pi Voy 
y « 
Do a9 
zy 

x 
—~p-s=<m 
a z 


has real solutions. 
Titu Andreescu, proposed for IMO 2002 
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Proof. The system has solutions if and only if 
P4P 4m? = tka +4. 
An easy computation shows that this relation is equivalent to 
(k+ 2)(L+ 2)(m +2) = (k4+14+m+4 2). 


As k+l+m = 2002, we deduce that any solution of the problem satisfies 
k+l+m = 2002 and 


(k + 2)(L + 2)(m + 2) = (k4+14+m +42)? = 2004? = 24. 3? - 1677. 


A simple case analysis shows that the only solutions are k = 1 = 1000,m == 2 
and its permutations. The result follows. Oo 


5. Solve in positive integers the equation 


(a + 2)(y + 2)(2 +2) = (e@+y4+242)*. 
Titu Andreescu 


Proof. A simple algebraic manipulation shows that the equation is equivalent 
tox*+y?+z? = zyz+d and, seeing this as a quadratic equation in z, we obtain 
the equivalent form (a? —4)(y?~ 4) = (wy—2z)*. If x? < 4, then y? < 4 as well 
and so ¢ = y = 1, yielding z = 2. If = 2, then y = z. In all other cases, we 
can find a positive square-free integer D (which is easily seen to be different 
from 1) and positive integers u,v such that «2 — 4 = Du? and y? ~ 4 = Dv’. 
Thus, solving the problem comes down to solving the generalized Pell equation 
a’ — Db® = 4, which is a classical topic: this equation always has nontrivial 
integer solutions and if (ag, bp) is the smallest solution with ag, bg > 0, then 
all solutions are given by 


ag + bo VD ig ag ~ by VD " 
ooh meray RE 2 (coe 


11. The relation a? +b? + =abe+4 ~ 7 


b= ag + by/D a! a9 ~ by VD i 
"VD 2 2 ; 


Part of the following problem can be dealt in a classical way, but we do 
not know how to solve it entirely without using the trick of substitutions. 


mi 


6. The sequence (dn)n>0 is defined by ag = a1 = 97 and 


O@n¢1 = AnOn-1 + y/ (aq — 1). — 4) 


for all n > 1. Prove that 24,97 9a, isa perfect square for all n. 
Titu Andreescu 

Proof. Writing 

(@nia = OnOn—1)" = (a3 ry (apy ~1) 
and simplifying this expression yields 

Oy a +24) — 2ontn1Gnai = 1, 
thus 

(2an—1)° + (2an)? + (2an41)? re (2an—1)(2an)(2Gn41) = 4. 


Since we clearly have a, > 2 for all n, this implies the existence of a sequence 
&y > 1 such that 2a, = 2, +271 and such that ¢n41 = &yC__1. Thus log zy, 
satisfies a Fibonacci-type recursive relation and so we can immediately find 
out the general term of the sequence (an),. Namely, a small computation 
shows that if we define a = 2+ V3, then 2, = a4», where Fi, is the nth 
Fibonacci number, Thus 


and so 


2 
24+ VIFQa, =24 (c+ a) (ah + 9) 
QuFn ihe 


The result follows, since a” +a™” € Z for all n, by the binomial formula. 
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Remark 1.1. Here is an alternative proof of the fact that all terms of the 
sequence are integers, without the use of substitutions. The method that we 
will use for this problem appears in many other problems. As we saw in the 
previous solution, the sequence satisfies the recursive relation 


2 2 2 
Ging HOH Opt 2An41AnOn—1 = 1. 


Writing the same relation for 2 +1 instead of n and subtracting the two yields 
the identity ; 
O42 = G4 = ndings (Qn42 — An~1)- 

Note that (a,)y is an increasing sequence (this follows trivially by induction 
from the recursive relation), so that we can divide by @n42 — @,—1 4 0 in the 
previous relation and get @n42 = 2Gn@n41 ~ G@n—1- The last relation clearly 
implies that all terms of the sequence are integers (since one can immediately 
check that this is the case with the first three terms of the sequence). Note 


square using this method. 

Remark 1.2. There are a lot of examples of very complicated recurrence 
relations that rather unexpectedly yield integers. For instance, the reader 
can try to prove the following result concerning Somos-5 sequences: let 
Q, = dg = +++ = G5 = 1 and let 

Gnt1Oned + Ong 20n+3 


On45 
an 


for n > 0. Then a, is an integer for all n. Similarly one defines Somos-6, 
Semos-7, ete sequences by the formulas 


ao = a, =--- = a5 = 1, 


ag =a, s+; ag=1, Gaye = sors na eT 


etc. One can prove (though this is not easy) that all terms of Somos-6 and 
Somos-7 sequences are integers. Surprisingly, this fails for Somos-8 sequences 
(in which case a17 is no longer an integer!). 


1.2. The relations abc => a+b+c+42 and ab+ be +ca+2abe= 1 9 


1.2. The relations abc =a+6+c¢+2 and 
ab +be+ca+2abe=1 


The first inequality in the following problem is very useful in practice and 
we will meet it very often in the following problems. 


7. Prove that if z,y,2 > 0 and zyz =a2-+y +242, then 
wy + yz+ 2a > WUa+y +z) and Wxt+ /yt+ V2 < ava. 


Proof. With the usual substitutions 


b+e , eta _ ath 
ant y= bd z ari 


the first inequality comes down (after clearing denominators and canceling out, 
similar terms) to Schur’s inequality 


a(a —b)(a — ¢) + b(6 - a)(b—c) + e(e~ a)(e— 8) > 0, 


while the second one follows-by adding up the inequalities 


fi. es a 
Vay Vote bte 3 \etal bee)’ 


The reader may find a bit strange the first method of proof of the following 
problem, but it is actually a quite powerful one. We will use again this kind 
of argument, see problem 11 for instance. Also, the third solution uses a very 
useful technique. 


im) 


8. Let 2,y,z > 0 be such that ay + yz + 22 = 2(x + y +z). Prove that 


ryz ety te+2. 


zabriel Dospinescu, Mircea Lascu 
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Proof. We will argue by contradiction, assuming that cyz > z+y+2+2. We 
claim that we can find 0 <r <1 such that X = re,Y = ry,Z = rz satisfy 
XYZ=X+Y+2Z +2. Indeed, this comes down to the vanishing of 


f(r) = Payz ~r(e+y +z) -2 
between 0 and 1, and this is clear, since f(0) <Q and f(1) > 0. Next, the 
condition cy + yz + 2r = 2(a + y +z) yields 
XY +¥Z4 2X =(X4+V4Z)<UX4+V4Z), 
This contradicts the first inequality of problem 7. Cl 
Proof. The condition can also be rewritten in the form 
(@~ 1-1) +Y-HeE-)+(-Yle@-N=3 
or in the form 
nyz —-&-y~z2-1=(e-1)\(y—-1(z~ 1). 
We will discuss several cases. If x,y,z > 1, then by the AM-GM inequality 
and the first identity, we get 
1> V@- iy =e -1P, 


which yields, thanks to the second identity, the desired estimate. 

If 2,y,2 < 1 or if only one of the numbers zg, y, z is smaller than or equal 
to 1, then {x — 1)(y — 1)(z — 1) < 0 and so zyz < e+ y+241 in this case. 
Finally, if two of the numbers are smaller than 1, say z,y < 1, the desired 
inequality can be written in the form 0 < «+ y+ 2(1 — xy) + 2, which is 
obvious. 


Proof. For three positive real numbers , y, 2 consider fixing the first two el- 
ementary symmetric polynomials oy = «+ y +z and og = cy + yz -+ zx and 
letting o3 = xyz vary. This amounts to varying only the constant term in the 
polynomial 


p(t) = ~ a1? + oat — 03 = (f~2)(t—y)(t— 2) 
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and defining x,y,z to be the three roots of this polynomial (in some order). 
Increasing 3, i.e. lowering the constant term, corresponds geometrically to 
lowering the graph. As we lower the graph, the smallest root increases, thus 
we maintain three positive real roots until the smallest root becomes a double 
root. If the double root is at t = a and the larger root at t = b, then we have 
0 = 2a+b, og = a? + Qab and 03 = ab. 

Tf we fix o) and o2 with og = 2c) as hypothesized, then we find b = oa 
and because 0 < a < b, we see that 1 < @ < 2. By the discussion above 
xyz = 03 < ab, so it suffices to show that 


2, .. 4- 
ab= 2a 


for 1 <a < 2. But this rearranges to (a—2)?(a? —1) > 0 and we are done. 


The technique used in the first solution of the next problem is rather ver- 
satile and the reader is invited to read the addendum 5.A for more examples. 


9. Let x,y,z > 0 be such that ry + yz + 22+ ayz = 4, Prove that 


WD aN 
3 (+ + a + =) 2 (e+ 2)(y + 2)(z +2). 


Gabriel Dospinescu 


Proof. Using the usual substitution 


aa tt 1p _ 2 
bee Oe a+b 


+ 


the problem reduces to proving the inequality 


nem 2 
bte (a+b+c)§ 
oe - ) ee ES CELE GE 
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This is a quite strong inequality and it is easy to convince oneself that 
most applications of classical techniques fail. However, the following smart 
application of Hdlder’s inequality does the job: 


a 2 
b+e 3 
(= 2x2) -(Sab+o") 2 (Lo+a), 
so it is enough to prove that 
3] [le +4) > 250 afb +)”. 

This reduces after expanding to S\a(b — c)? > 0, which is clear. Oo 
Proof. First, we get rid of those nasty square roots, via the substitution 

ry = 4c”, ys = 4a’, za = Ab. 


Then 


and replacing these values in the inequality yields the equivalent form 
3(a+ b+)" > 16(a + be)(b + ca)(c + af). 


The hypothesis becomes a? + 6? +c? + 2abc = 1, so that there exists an acute- 
angled triangle ABC such that a = cosA, 6 = cosB, ¢ = cosC. Next, 
observe that 


ct+ab=cwsC + cos A+ cos B = —cos(A+ B)+ cos A+ cos B = sin A+ sin B. 


Using this (and similar identities obtained by permuting the variables), the 
desired inequality becomes 


V3 3 cos A > 4] [sin A. 


Using the well-known identities 


#=4R] cos 4, r aR] sin 4, Vos A =14+5, 
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the inequality becomes 


r+R_ 2rs 

ane Shits eclaas 

v3 R > RR? 
or (R+r)RV3 > 2rs. This splits trivially into R+r > 3r (Euler’s inequality) 
and s < a¥8 R, the last one being well-known and easy. im) 


Here is yet another easy application of problem 7. 


10. Let u,v,w > 0 be positive real numbers such that 


wbotwt fuew = 


jes v fou 
+/2+/2 bald — SPutvatw. 


China TST 2007 


Prove that 


Proof. To get rid of those nasty square roots, let us perform the substitution 


V UY fow V uw 
a, - b. 
ay u 


Then u = be,v = ca i: w= ab, so that the = becomes a +6+¢ > 
ab + bc-+ca and the hypothesis is ab + bc + ca + abe = 4, Another substitution 


2 2 
a@=-, bs~, c= 2 
z y z 
yields xyz = 2+ y+ z-+2. We need to prove that zy + yz +22 > Aetyt+z), 
which is the first component of problem 7. 


The following problem has some common points with problem 7 and one 
can actually deduce it from that result. But the proof is not formal. 


il. Prove that if a,b,c > 0 and « =a+ i y=b+ Ege ds then 
¢ a 
ay + yz + 2x > A(a+yt 2). 


Vasile Cartoaje 
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Proof. The method is the same as the one used in problem 8. The starting 
point is the observation that ryz > 2+2+y+ z. Indeed, 


i iit : 
= abe + — $ - ~ peti > i rs 
BYZ abe + tate cite : gt gzttetyte 


Let us assume for sake of contradiction that xy + yz + z@ < 2(a+y +2) and 
24+X+¥+4+Z. This equality is equivalent to raayz == 2+ r(x+y+z) and such 
r exists by continuity of the function f(r) = r32yz—2—r(e+y+2) and by the 


fact that f(1) > 0 and f(0) < 0. The hypothesis cy + yz -+ zz < 2(a+y+ 2) 
can also be written as 


XY +YZ4+2Z2X <2(X+V4Z) <2X+Y + Z). 


This contradicts the first inequality in problem 7. 


Proof. As in the previous proof, we obtain that ayz >2-+y +2+2, Since we 
obviously have min(xy,yz,2z) > 1, this implies that z > aaa and similar 
inequalities obtained by permuting the variables. Next, we have 


1 1 1 
atytematot+bt ep tet = 26. 


In particular, there are two numbers, say 2,y, such that r+y > 4. The 
inequality to be proved is equivalent to z > Eee so we are done if we 
can prove that 

2+aty S 


ay-1 ~ x+y-2 


But this is equivalent, after an easy computation (in which it is convenient 
to denote S = x+y and P = zy), to (2y—y—az)? > (2 ~— 2)(y — 2). If 
(z ~ 2)(y ~ 2) < 0, this is clear; otherwise z,y > 2asr+y > 4. Let 
u= g—-2,.y = y—2, then the inequality becomes (uv + u+v)? > w, with 
u,v > 0 and it is obvious, 0 


The form of the following inequality strongly suggests the use of the 
Cauchy-Schwarz inequality. It turns out that this approach works, but in 
a rather indirect and mysterious way, which makes the problem rather hard. 
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12. Prove that for all a,b,e > 0, 


(b+e-a)? 
(b+ 0)? + a? 


_fera-b) | (at+b—c)*? _ 3 
; p25 


Japan 1997 


the problem asks to prove that for any positive numbers z,y,z such that 
eyz=xn+tytzt+2 we have 


(@-1P | (y-1P 


z+ yeti 


Applying the Cauchy-Schwarz inequality (or what is also called Titu's lemma), 
we obtain the bound 


(@=1)? | (y-1)? | (2-17 , (wt 


ear yer tat 2 ay 


+23) 
+27 +37 


ene. 


It. remains to prove that the last quantity is at least 
P= xy + yz + zx, so that the inequality becomes 


. Let S=au+y+z and 


Ss 


Notice that as $+ 2 > 2 and cyclic permutations of this, we have S > 6, Also, 
by problem 7 we have P > 2S. Therefore, 


Remark 1.3. There are other solutions for this problem: some of them are 
shorter, but not easy to find. Here is a particularly elegant one, based on 
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the linearization technique: the mequality is homogeneous, so we may assume 
that a +6+¢= 1. We need to prove that 


— 
| 
a 
= 
cc 
+ 
g 
ne 
on 


ay. 2 * . ’ 
The point is to bound from below 7h ae —; by an affine function of a, suitably 
(i-a)}*+« y 
chosen. The best choice is the following 


(1 ~ 2a)? 69 5da t\7 1 
=e ese =e fee hs 
Glare 7 3 aq} (e+) 2° 


Of course, the question is how one came up with something like this. Well, it 
is actually very easy: we need constants A, B such that 


> Aat+B 


for all a € [0,1], with equality for a = 3. Imposing also the vanishing of the 
derivative of the difference between the left and right-hand side at 4 yields the 
desired constants A,B. Once we have the previous estimates, it is very easy 
to conclude by adding them and taking into account that a+b+e=1. 


13. Find all real numbers & with the following property: for all positive 
numbers a, 6,c the following inequality holds 


a b ¢ i\° 
oo ia — >tk+-}). 
(« b+ :) (« = (: a+ :) = (« 3) 


Vietnam TST 2009 


Proof. Take first of all a = b = 1 and arbitrary ¢ to get that 


(e+ Ay) (82 (+5) - 
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Letting c -+ 0, we deduce that any & as in the statement must. satisfy 
; i\8 
k(k+1P>[k+=). 
3) 
which is equivalent to 4k? + 2k > 1. We claim that any such k is a solution of 


the problem. 
Pick such & and perform the usual substitution 


to reduce the problem to 


Ro te 3 
BaP ety tat kay + yz + 22) 4+ 2yz > (« + 3) F 


Now, we know that xy + yz + zx + 2xyz = 1 and it is by now a classical fact 
(use problem 7 for 1/x,1/y,1/z) that x+y +2 > 2(xy + yz + 2a). Thus, it 
is enough to ensure that 


% _— 7 3 
K+ (24? + b)(ay + yz +20) + can Hrs (« ) 


This can be rewritten as 


4 1 : 
(2 tk~ 5) (+v + yz + ze — i) 20. 


But the last inequality follows from the fact that 2k? + & — i > 0 and 


LY ye + 2a > LP 
which, by returning to the substitution. 


C 
z 


“ath 


be 
is equivalent to 7 a(b — ¢)? > 0. 


In conclusion, the solutions of the problem are the real numbers k such 
that 4k? 42k > 1. D 
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We end this section with a very challenging problem, which answers the 
following natural question: what would be the analogue of the classical sub- 
es = ake when we have five variables? We warn the 


stitution # = “2 : 
reader that the first solution is really not natural... 


14. Let ay,ag,...,@5 be positive real numbers such that 
ayag...a5 = ay (1 + a2) + ag(1 + ag) +--+ a5(1 t+ ay) + 2. 


What. is the least possible value of 2 + x tet 2? 


Gabriel Dospinescu, Mathematical Reflections 


Proof. Consider the linear system 


x2 + ari, wate, = Ag%Q, Ta + Ip = A3H3, 


5 +H, = 4X4, T1 + Lo A5Xg. 


We can try to solve this system by expressing, for example, all variables in 
terms of x4,%%. Namely, we can use the fifth, first and second equation to 
express £2, 23,04 in terms of 24,25. Replacing the obtained values in the 
other two equations and eliminating 71,75 between them, we obtain that the 
system has a nontrivial solution if and only if 


ie 


5 5 
an 
Tl« = 2+ S G4 + a1A4 + 442 + A205 + A503 + O301- 


All the previous (painfull) computations are left to the reader, since they 
are far from having anything conceptual. Note that the same result can be 
obtained by computing the determinant of the associated tnatrix. Now, the 
previous relation is almost the one given in the statement, up to a permutation 
of the variables. So, since the conclusion is symmetric in the five variables, we 
will assume fromm now on that the previous relation is satisfied instead of the 


one given in the statement. 


1.2. The relations abe = a+b+¢+42 and ab+be+ ca + abe 


19 


The crucial claim is that under the previous hypothesis, the system has 
solutions whose unknowns are positive. Probably the easiest way to prove 
this is to exhibit such a solution. Well, simply take a; = 1, . 


ae = Lb tan + 3 + a1ag 
B53 = 


1+ G5 + a@3a5 + aga5 
and the define 


_ &4 + Xs 


, t= — 
a3 ag 


The question is, of course, how on earth did we choose the value of «rs? Well 
simply by solving the system, as indicated in the beginning of the solution, 
Note that we clearly have x; > 0 and an easy computation shows that these 
are solutions of the system (we only have to check two equations, since three 
are satisfied by construction). , . 
The conclusion is that if the a,’s satisfy the condition 
5 


5 5 
I a= 2+ DE Gj + 144 + A402 + a2@5 + agag + agay, 
i=l 


tex 


then we can find 2; > 0 such that 


Lg * 2g 3b x, ’, 
a a SL en 

wy £2 7 x5 
So, the problem reduces to finding the minimal value of 


ry x2 


fatty «eg+24 


We claim that the previous expression is always at least 5/2. By Titu’s lemma 
this reduces to proving that 


eh ato perdades eats 2S 
(x1 + ao +23 + 24+ 25) 2 pny 


But since 


23 x0; = (Sox) -~ Da, 
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this reduces to (Soa)? <5 Yo 27, which is Cauchy-Schwarz. 
Putting everything together, we obtain that the minimal value of 


is 5/2, attained when all a;’s are equal to 2. Who wants to play now the same 
game with 7 variables? 


Proof, Here is another proof, also far from being evident... We will use the 
following nontrivial 


Lemma 1.4. For all nonnegative real numbers £1,%2,...,%5 we have 
(0) # Bg +++: + ts)® 2 25(wraews + waegry + +++ + 5172). 


Proof. This is not easy at all, but here is a very elegant (but unnatural) proof: 
consider the identity 


wy eeng + Lgnghs + LgH40H Hp T4Cgl1 PF LHxALQ 
x= x5(ay + .23)(ag +04) + epea(ay + £4 — 25). 
We may assume that x5 = mina; (since the inequality is eyclic), so that 
ty +44—-%5 > 0, Denoting 2 + #2+ 43 +24 = 4t and using the AM-GM 
inequality, we deduce that 
_ (S22. i 
3 : 


ag(ay +23)(@e + €4) + taes(a1 + ta a5) S 4tP x5 


Thus, it remains to prove that 


By homogeneity, we may assume that 25 = 1 and then expanding everything 
the inequality becomes, with the substitution 4t—1 = 3x, (e— 1)*(82+7)} > 0, 
which is clear. a 
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Using this lemma for the inverses of the @;’s and denoting 


we deduce that 


) ———— 


AjO42 142 


On the other hand, by Mac-Laurin’s and AM-GM inequalities we also have 


nds 
Taking into account these inequalities and the hypothesis, we deduce that 


st gs% ag5 
get get ge 2h 


which immediately implies that S$ > 8. oO 
1.3 The relation a? + 6 +c? + 2abe = 1 


The following problem is a rather tricky application of the AM-GM in- 
equality. 


15. Prove that in all acute-angled triangles the following inequality holds 


cos A fe cos B ee cos)? 
coeB cos 4) + 8cosAcos BeosC > 4. 


Titu Andreescu, MOSP 2000 


Proof. Since 


cos? A + cos? B + cos? C + 2.cos Acos B cos C = 1, 
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the inequality can be written 


2 2 2 
208 cos B 20S F 
(; os 3) t € ate ) 2 (= <) > A{cos? A + cos? B + cos? C). 


cos B cos C cos A 


Let 
= cos? A, y=cos*B, z=cos’C, 


so we need to prove that 


242 a at y sy). 
ar oe: 


The point is that we actually have 


for any positive real numbers z,y,z, as follows by adding up the AM-GM 
inequalities 


Thus, it is enough to prove that ryz a. But. this is well-known and 


easy to prove. G 


IA 


The following problem is a preparation for a hard problem to come, but 
has also an independent interest. 


16. Prove that in every acute-angled triangle ABC, 
(cos A + cos B)? + (cos B + cos C)? + (cos C + cos A)? < 3. 


Proof. Letting 


yee ae ee Ce ee 
cos A set cos B Via cos C Vin 
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for some positive real x,y,z, we obtain the relation ryz = 2+y+2+2 from 
the classical identity ST cos? A+ 2] cos A = 1, Thus we can find positive real 
numbers a,b, ¢ such that 


_ bee _ eta eat 


a? 9 b? ¢ 


The desired inequality can be written 


Day t Day Sa ott Devas ae 
= at DvwS5 = (ety+24+2) = (Siva) <2($72+3). 


This follows from Cauchy-Schwarz 


(Eva) <(De+0)-(C!)=2De+ 


Proof. Since the triangle is acute, there are positive numbers z, y, z such that 


QO 


‘Then 


and similar identities for cos B, cos ‘ The pie, inequality can be written 
as 


ee 


erica wiaVerners © 


which (after clearing denominators) is equivalent to 


Y (@e+a+eve 


and then to 


LVEF EFF < toy +5 valu +2). 


eye eye 


)) < (2 +yyt2)(z+2) 
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However, this follows immediately from the AM-GM inequality: 


and two similar inequalities. oO 


Even though the following problem seems classical, it is actually rather 
hard. Fortunately, we did the difficult job in the previous problem. We 
also present an independent and very elegant approach due to Oaz Nair and 
Richard Stong. 


17. Prove that if a,b,¢ > 0 satisfy a? + b? +c? + abe = 4, then 


0 < ab+ be + ca — abe < 2. 


Titu Andreescu, USAMO 2001 


Proof. The inequality on the left is easy: the hypothesis and the AM-GM 
inequality imply that abc < 1, so that min(a,6,c) < 1. We may assume that 
¢ = min(a, b,c), so that 


ab + be + ca ~ abe = cla + b) + ab(1 — ¢} > 0. 


The hard point is proving that ab-+be+ca~abe < 2. Taking into account 
the hypothesis, this can also be written as 


a+b\” b+e\? ate\? 
picaiilanty ats 
(a) Ca) Ha) 


If we denote a = 22,6 = 2y,c = 2z then a? + y? + z? + 2ayz = 1 and so 
there exists a triangle ABC such that @ = cos(A),y = cos(B),z = cos(C). 
Thus, the problem reduces to the previous one (since a,6,¢ are nonnegative, 
the triangle ABC is acute angled). i) 


Proof. Here is a very elegant proof for the hard part of the inequality. Among 
the three numbers a~1,b~-1,¢-1, two have the same sign, say b~1 and c-1. 
Thus (1 ~- &)(1 ~ c) > 0, implying that 6+ ¢ < be + 1 and ab+ ac — abe < a. 
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Thus, it is enough to prove that ¢ + bc < 2. But the given condition and the 
fact that @ is nonnegative imply that 


Thus, it is enough to prove that /(6? ~ 4)(c? = 4) < 4—be. Note that be < 4, 
since b? < 4 and ¢? < 4. Squaring the previous inequality thus yields an 
equivalent one 

BP? » 4b” — de? + 16 < b*c? — Bbe + 16, 


which is obvious. o 
We end this chapter with another challenging problem, which reduces after 
some tricky algebraic manipulations to the infamous “Iran 1996 inequality.” 
18. Prove that in all acute-angled triangles the following inequality holds 
ab? 202 Be’ 


@ c 
— _ 9R?. 
Re a2 


Nguyen Son Ha 


Proof. If A, B,C are the angles of the triangle, the sine law and the identity 
sin? 2 + cos*@ = 1 yield the equivalent form of the inequality 


Sues (1 ~ cos* A)(1 — cos? BY 
1- er ce 


Write 
cos*A=yz, cos*B= 20, cos?C = zy. 
Then zy + yz + 2a + 2xyz = 1 and so there exist positive numbers X,Y, Z 


such that 
x Yy Z 


Vag 8 xg OP Oey 


We want to prove that 


(l~yz)(l-24) 9 
se 1l-ay 2 4 
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On the other hand, we have 
lays aed oa 


so that the inequality is equivalent to 


AY(X+Y4Z) s 9 
ZAX+Y)2 74 
This can also be written in the form 
1 9 
ZS pn > 
XYZ(X+¥+2Z)-S° GX p ave 24 


and it is a consequence of the following famous inequality 


Lemma 1.5. For all positive numbers a,b,c we have 


1 9 
(ab + be + ca) (eng: + ne we 4 Ge 7) > i 


Proof. We may assume that a > b > c. First, we show that 


i I 1 1 2 


(a+b? ° (ato)? 7 (b+)? 2a (a+e)(b+e)° 


This can be rewritten 


ee ae ee Ce 
ate b+e} ~ dablat bj?’ 


or equivalently 4eb(a+b)? > (a+c)?(b+c)?. This is clear, as (a+b)? > (a+c)? 
and 4ab > (b+ c)*. 
Thus, it remains to prove that 


Te. «. 2 9 
(ab + be + ca) las + Gt SETA 2 
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Using the identities 


abtbe+ea 1 catb)  2Aab+ be + ea) 27 
4ab 4° dab * (at e\(breo) {a+ chb+e) 
this becomes 2 
c{a + 6) s 2c? 
4ab ~ (a+c)(b +e) 
But this is simply the standard inequality (a + 6)(a+c)(b+ ¢) > 8abc and we 
are done. 5 
This completes the solution to the problem. o 


1.4 Notes 


We would like to thank the following people for their solutions: Vo Quoc 
Ba Can (problems 13, 18), Xiangyi Huang (problems 4, 5), Logeswaran La- 
janugen (problem 1, 9), Oaz Nair (problem 17), Dusan Sobot (problems 2, 9, 
16), Richard Stong (problems 8, 17), Gjergji Zaimi (problems 2, 3, 6, 7, 8, 11, 
12, 15, 16, 17). 


Chapter 2 


Always Cauchy-Schwarz. .. 


As the title suggests, all problems in this chapter can be solved using the 
Cauchy-Schwarz inequality, even though sometimes this will require quite a 
lot of work. Let. us recall the statement. of the Cauchy-Schwarz inequality: if 
@1,@2,...,Gn and by, bg,...,6p are real numbers, then 


(aj +a} +---4+02)- (2+ bb to + 2) > (ayby + aghy Bee + nbn). 


«. Indeed, the left-hand side is a quadratic function of x with nonnegative 
values, so its discriminant is negative or 0, But this is precisely the content, of 
the Cauchy-Schwarz inequality. Another proof is based on Lagrange’s identity 


(= 7) ; ( ‘ i) _ (Soa) = Se (azbj — a3b;)?. 
1 i=) 


i=l i= igi<jsn 


This useful identity will be used several times in this chapter. 

As the best way to get familiar with this inequality is via a lot of examples 
of all levels of difficulty, we will not insist on any theoretical aspects and go 
directly to battle. We start with two easy examples, destined to give the 
reader some confidence. He will surely need it for the more difficult problems 
to come... 
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1. Let a, 6,¢ be nonnegative real numbers. Prove that 
(ax* + bx + e)ex® + be +a) > (a+b + ¢)?x? 


for all nonnegative real numbers x. 


Titu Andreescu, Gazeta Matematica 


Proof. This is just a matter of re-arranging terms and applying Cauchy- 
Schwarz: 


(ax? + be + ¢)(cx? + be + a) = (ax? + be + c)(a + be + cx?) 
> (ax + ba + ca)? 
= (atb+ex, 


o 


2. Let p be a polynomial with positive real coefficients. Prove that if 


1 ear rt 
p G > ——\ is true for x = 1, then it is true for all z > 0. 
z} ~ pa) 
Titu Andreescu, Revista Matematica Timisoara 
Proof. Write p(X) =* ag + a@,X +--+ +@,X” and observe that 


an 


1: a 
plx)p () = (ap tae toe + an2") (ao eae 


The result follows. 0 


The following exercise is already a bit trickier, due to the lack of symmetry. 


3. Prove that for all real numbers «, b,c > 1 the following inequality holds: 


Va-1+Vb—14 Ve—1 < Valbe+ 1). 
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Proof. The inequality being symmetric in 6,c, but not in a, it is natural to 
deal first with /6-1+/c~ 1. This is easy to bound using Cauchy-Schwarz: 


Vo-it+ve-1< Y@-14D0 1) = Vbe. 


So, it is enough to prove that Ybe + Va—1 < \/albe +1). But this is once 
more the Cauchy-Schwarz inequality. 9 


Another easy, but a bit exotic application of the Cauchy-Schwarz inequal- 
ity is the following Chinese olympiad problem. 


4. Let n be a positive integer. Find the number of ordered n-tuples of 
integers (a1, a2,...,@,) such that 


a, +a2+---+a_ > 7? and a? tate tare cn $1. 
China 2002 
proof By the Cauchy-Schwarz inequality we have 
ay tag tes tan S Jnl +0) <n? +1. 
Since a; are integers and a; + a2 +--+ +a, > 7, we must have 
ay tage tay = n?. 


But then (again by Cauchy-Schwarz) we have af + a3 +-+-+a2 > n¥, forcing 
af +ag+---+a2 € {n3 nF 41}. If af+ag+---+02 = n3, then we must have 
equality in Cauchy-Schwarz, implying that all a;’s are equal to n. Assume 
now that a? + a3 +:+-+a2 =n'+1 and let bj = a;~n. Then 


B+ B+. +O and +1~In-n? 473 =1, 


forcing all but onc 6; vanish. This is however impossible, as 6j+b2+---+-b_ = 0. 
Therefore, this second case will not occur and the only solution is 


a, = 42 = 0+ =a, =n. Oo 


We continue the series of easy exercises: 
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5. Let ai,a2,-...219 be real numbers between 0 and 5 such that 


sin? 2; + sin? zg + +++ +sin? x19 = 1. 
Prove that 
3(sin 2; + sin zg ++ +++ sin ay) < cose] + cosxg + +--+ coszyio. 
Saint Petersburg 2001 


Proof. The Cauchy-Schwarz inequality and the hypothesis yield 


1 ‘ A 
> gain ag + sings +--+ +sinwig) 


and similarly for the other variables. The result follows by adding up these 
inequalities. : og 


The following exercise combines an easy application of the Cauchy- 
Schwarz inequality with some classical formulae from geometry. Recall that r 
is the inradius and that s is the semi-perimeter of a triangle. 


6. The triangle ABC satisfies 


ee 2cot 2 *t sear) = (8) 
cots ai cot 5 + Scot 5 = lay: 


Show that ABC is sirnilar to a triangle whose sides are integers, and 
find the smallest set of such integers. 


Titu Andreeseu, USAMO 2002 


Proof. Let the incircle be tangent to the sides of the triangle at points which 
split the sides into segments of length x and y, x and z, and y and z. Thus, 
the sides of the triangle have lengths «+y, 2+ 2, y+ 2. 
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The equation 


This simplifies to 
49(z7 + dy? + 9a") = 36(2 + y + z)*. 
Now, by Cauchy-Schwarz we have 


1] P 3 . 
( tar 5) (27 + 4y? + 92") > (@ ty +2), 


yielding 

49(2? + 4y? + 907) > 36(a + y + 2). 
Thus, we are in the equality case of the Cauchy-Schwarz inequality, which is 
precisely the case when there is a k > 0 such that « = 4k,z = 36 9k, 


Then the sides of the triangle are 13k, 40k, 45k. Thus the triangle is similar 
to the triangle with sidelengths 13,40, 45, and the problem is solved. 0 


The statement of the following problem looks rather classical. There are 
however some technical problems which make the problem more difficult than 
expected. 


7. Let n > 2 be an even integer. We consider all polynomials of the form 
2" + Gy_y2") +--+ a,x +1, with real coefficients and having at least 
one real zero. Determine the least possible value of a + a3-+---+a2_,. 


Czech-Polish-Slovak Competition 2002 
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Proof. Suppose that the corresponding polynomial has a real zero «. Using 
Cauchy-Schwarz, we obtain 
(a” +1)? = (ae + age 4 + aye YP 
S(t abt--+ + a2_y (a? tat ye. $22), 
Thus 


at +i 2 
ota} tad > a Ft = sea) 


and so we need to find first the minimal value of f. Now, looking at the 
zeros of the derivative of f suggests that the minimal value might be taken at 
zx = 1, so that it equals =*;. This is not easy to prove using derivatives, as 
the computations are a ditt nasty: Instead, we will prove in an elementary way 
that 


n—-1 
4 
In order to prove this, note that we have the inequalities 


(2 +1)? >a? tate pend, 


g+i>etat, etal eatea™ 4... et +i zat ye’. 
Adding them shows that 


= Q ig 
Se" + 2at+ets poe 4 02, 

On the other hand, multiplying the last inequality by 2” and adding the result 
to the previous inequality gives the inequality 


n~ e= 2m ay > phe bah PHP pp gD, 


Thus it remains to prove that (2” + 1)? > 4a”, which is clear. 
The previous paragraph shows that 
eter 
my + a9 cr 
if 2” +an-12" 1} +++-+a,2+1 has at least one real BerOs On the other hand, 
choosing a; = ag = +++ = Gp) = —=%; shows that 4, is optimal. ia 


wT 
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The denominators in the following inequality look awful, but a clever 
application of the Cauchy-Schwarz inequality can make all of them equal. 
The method of proof is worth remembering, since it appears quite often. 


8. Prove that for any positive real numbers «,y,z such that cyz > 1 the 
following inequality holds 


¥ 2 


_ pits < 
ryt bt Bie Wyaity 2! 


Tuan Le, K6MaL magazine 


Proof. Using Cauchy-Schwarz, we obtain 


i 
(2° +9 +2) (3 +142) > (e@+y+2)?. 


Thus 
£ etgtlt+s) 
w+yete™ (cty+z2)?" 


Writing down similar inequalities for the second and third terms of the left- 
hand side, we reduce the problem to proving that 


Didtete) s@tyt2 oe Set eye 3+ 2. 


This is immediate from 


Sey > 3 (xyz)? > 3 


and 


2. (@+y+2)? 
ve Gtyts 2 Yryz(ztyt+z)>aet+ytz, a 


Here is a rather nice-looking inequality taken from a Romanian Team 
Selection Test. ‘There are plenty of ways to prove it, but what follows is 
particularly elegant and natural. 
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9. Let n > 2 and let a1, @2,...,@, and 61, bg,...,6, be real numbers such 
that 
G++ o¢a = Rt Bt +B wl 


and a1), + agbg + +++ + dnb, = 0. Prove that 
(ay Fag bees + ay)? t+ (br the be + ba)? Sn. 
Cezar and Tudorel Lupu, Romanian TST 2007 


Proof. The proof mimics the proof of Cauchy-Schwarz: take any real number 
x and apply the Cauchy-Schwarz inequality to obtain 


> (e428) arbj)? > 5 oie =i‘ a ay bi)? 


i=] 


Tt 


By hypothesis, the left-hand. side equals 1 + x?, Therefore 


(Sa 5h) <i < n(x? +1) 


for any real number a. The difference between the right-hand side and the 
left-hand side being a quadratic polynomial which takes nonnegative values 
on the whole real line, its discriminant has to be negative or zero, thus 


4A? B? < 4(n — A*)(n — B?), 


where A = 37, a; and B = S72., by. It is immediate to check that this is 
equivalent to the desired inequality. CQ 


The following problem is easy, but the lack of symmetry might make it 
appear more difficult than it really is. 


10. Find the largest real number T with the following property: if a,b, ¢,d,¢ 


are nonnegative real numbers such that a + 6 =c+d +e, then 


Vetere ¢ete >T(fa+ vot Vet Vd4 ve). 


37 


Proof. This is a simple application of the Cauchy-Schwarz inequality. Namely, 
we have 

2 

a+h > md , etdperr Crete)? _ (eb? 

2 = 3 


so that. 


On the other hand, 


Vatvi< /2(a + by, Ver Vd + es V3(c+d+e) = 


therefore 


(Vat Vb+ Ve+ Vd+ Ve)? < (V2 + V5)%(a +8). 


Combining these two inequalities yields the estimate 


@+h+ e+ P+ ea>— YY aa vh vd 7 
ot Vved 
a+ Van ve b+ ve + Vey’. 
To see that this is optimal, it suffices to keep track of the equalities in the 
previous inequalities. For instance, we can take a = b = 3 and c= d=e = 2. 
Thus the answer is T' = aw G 


The next problem requires a whole series of applications of Caucl lay- 
Schwarz. 


li. Let 21, 22....,a be positive real mumbers such that 


t 1 1 


wo pt po 
T+aq 


T+a, lta 


Prove the inequality 


Vii + Via +-++ Vin > (n~1) \(Z 


Vojtech Jarnik Competition 2002 
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so that S> a; = land aj = Bett Then, using Cauchy- 


vi va= 5 a= 


Rearranging terms in the previous sum oo 


Dv Jet yet va) 


and using again ae we can bound this from below by 


(n- = va 
Fea > ay a me oe 


Using once more Cauchy-Schwarz for the pene of each fraction 


2 Van y + vast: + vt Van 
J(n— Iai 


Via + fag te + Jin < V(r Il@ + a3 +--+ + Gn) 
yields the desired inequality. Oo 


The idea of the following example is worth keeping in mind, since it turns 
out to be useful in a wide range of problems. 


12. For n > 2 let a1, a2,...a, be positive real numbers such that 
11 1 1\? 
(aj +49 +--+ Gn){— +o tet] S[n+5) - 
a a Gn, 2 
Prove that max(ai,a2,...,@,) < 4min(a1,a2,..-,@n). 
Titu Andreescu, USAMO 2009 
Proof. The idea is to fix two of the variables, say a), a2 and apply the Cauchy- 


Schwarz inequality to get rid of the remaining variables. Explicitly, this can 
be written in the form 


(n+}) 2 
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where the first inequality is simply the hypothesis, while the second one is 
Cauchy-Schwarz applied to n — 1 terms, grouping the terms indexed 1 and 
2 in each multiplicand. We deduce that (or-tasy* < 3. which immediately 
unplies that 

max(a;,@2) < 4min(ay, a2). 


Since everything is symmetric in @1,@2,-..,@n, the result follows. 


Oo 


A bit more difficult, but with a similar flavor is the following problem. 


13. Let n > 2 and let 2), 29,...,%p be positive real numbers such that 


1 1 
(ayant a9) (Z- oth) nate 
cat Ln 
Prove that 
9 1 1 d 2 2 
Died 2 i 2 
(a7 +29+ 10) (S+% +a) >n + Gat 


Gabriel Dospinescu 


Proof. The crucial idea is to write the hypothesis in a different way: expanding 
the product and rearranging terms shows that the hypothesis can also be 
written 


L ( / 
isi<jsn 
Let us write 
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Since 


2 
Ei 2p 
¢ is of 2) Soe tie 


the inequality to be proved becomes 


s (44+ tigltig > 4+ 


n(n — 1)" 
1<écj<n (n—1) 


which is equivalent (taking into account the hypothesis that the sum of all ¢;,; 


is 1) to 
@, 
u > ay 


i<i<jen 


This follows from Cauchy-Schwarz inequality, but there is still a detail to be 
explained: we have to show that. we cannot have equality. But if we had 
equality, all t;,; would be equal to we D and so all numbers = +% 4 would be 
equal. Since n > 2, this would force an equality 2; = 2, for some 5 ot k. But 
then ¢;4 = 0, a contradiction. a 

We continue with an inequality which combines a rather direct application 
of the Cauchy-Schwarz inequality with a nice telescopic identity. We also 
present a beautiful alternate solution, due to Richard Stong. 


14. Prove that for any real numbers 2), 02,...,¢, the following inequality 
holds: 
a, 22 ; A a 
; 5 5 ral a wa “se YT. 
L+2? Lae ead | "Late pad ue 


Bogdan Enescu, IMO Shortlist 2001 


Proof. The sohution is very short, but far from obvious. The idea is to use the 
Cauchy-Schwarz inequality to reduce the problem to 

eae 24 0 ee 
(L402)? © (1+ a? +23)? (1+ az+..++422)2 


Al 


Thus, we need to prove that for any ay,@....,¢, > 0 we have 


n 


es 
ft (Lay tes + aj)? > 


Define S$; = a) +a2+---+a;, with the convention Sy = 0. This is an increasing 
sequence and so 


cn Ge = Sy 

iaas + a;)? a= LG 14-5; 
L 

Lis 


= < 
T+ Sh 


il 


Here the first inequality is just 1+ A? +22 > 14 A? with equality if and 
only if ¢ = 0; the second is Cauchy-Schwarz with equality if and only if 
ave = V1 +A?. Thus we cannot have equality in both cases and the final 
inequality is actually strict. Applying this inequality repeatedly, it is easy to 
see by downwards induction on k that 


Bay 
i+ ae re 
1+? 
The case k = 0 is the desired result. oO 
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The trick of making the denominators equal thanks to a smart application 
of Cauchy-Schwarz or AM-GM inequality is also used in the following problem. 


15. Prove that if a, b,c,d are positive real numbers, then 


a b ce ad 4 


: Te ane ee 
BPSTR 2 ere eee Tet are $+? atrbt+etd 


P.K. Hung 
Proof. Using the AM-GM inequality in the form (x + y)? > 4zy, we obtain 


Se BO estes 
b+ c8 4 d? > (a2 +42 + a2)? 
Adding these inequalities yields 


ee >4 Ya 
POLE (Soa?) 


By Cauchy-Schwarz, 


Inserting this in the previous inequality yields the desired result (note that 
the inequality is strict, since otherwise the equality in the Cauchy-Schwarz 
inequality would yield a = b = c = d, for which the inequality is strict). Note 
that even though the inequalities we used were very rough, the constant 4 is 
optimal: simply take a = 6 and c,d close to 0. oO 


One needs rather good gymnastics with Cauchy-Schwarz to deal with the 
following problem. 


16. Let n > 2 be an integer and let 21, 2,..., 2%, be real numbers satisfying 
apt ah ee ae? + yee + worg + + Enid = 1. 


For a fixed 1 < k <n, find the maximum value that {z,/ can take. 
China 1998 
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Proof, Let us write the condition in the form 
ay + (ay + @2)* + (ay +23)? + (ayy an)? $22 = 2 
If we fix 1 < k <n and apply the Cauchy-Schwarz inequality twice, we obtain 
BY + (wy + ag)® +--+ (tent + ap)? 


yy (Pi = (1 + 22) + (a2 + 3) 0 + (IF tent +2x))? _ 2h 


ES k k 


and 
(ty + oeg1)P +e + (ep + tn)? + a 
yp (e+ het) ~ Cap + ep2) Fe — ea)* eh 
os nm~k+1 n-k+l1' 
Taking into account the hypothesis and these two inequalities, we deduce 


that 2 > aot for all k, so that 


nes 
f2k(n+1—k 
lel < f BOB) 
n 
By studying the equality case in the previous inequalities, one immediately 


sees that this value is attainable for each fixed k. Specifically, if we define 
2k(n+1— =f) 
eet 


Op = and take zp = 441 = 0 by convention, then equality occurs 
if (-1)*-42; interpolates linearly and evenly between these endpoints and 
Zp = ag. The explicit formula is 
(~1)R-F igs GSk 
a= ( 


~1p a 52k 


Another ingenious application of the Cauchy-Schwarz inequality can be 
found in the following problem. 


17. Let a,,¢ be positive real numbers. Prove that 


[ee oe 1 7fl 14 4 .9\2 
st+iut+até 2s (-4+5+-+——— 
a © b2 " ¢2 (atb+e~ (a? b ot abt) 


Tran 2010 
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Proof. We see that we have equality when a = b = ¢, so we have to apply 
the Cauchy-Schwarz inequality in a smart way for the left-hand side of the 
inequality. Namely, start with 


(fives) (oe ees : ) 
ee 9) \e PO a+b+e)" 


ey ane ers 1 i 
“\a be Batb+eo)) © 


This reduces the problem to showing that 


1 ky r 1 >i ii 
ab” Blatb+e)~ 15 \a' b 


Fortunately, this is equivalent. to the classical inequality 
fe Med Ss 9 


and we are done. o 


The form of the following problem strongly suggests using Cauchy- 
Schwarz. However, it is rather easy to check that many attempts fail... 


18. Let «,y,z be real numbers and let A, B,C be the angles of a triangle: 
Prove that 


esinA+ysinB+zsinC < /(1 + x?)(1+ y?)(1 + 27). 
Proof. Using that C = a — A — B, we obtain the identity 
asinA+ysin B+ zsinC = (x + zcos B)sin A+ zsin BoosA + ysin B. 


Using Cauchy-Schwarz and the fact that sin? A + cos* A = 1, we obtain 


(e+ zc0s B)sin A+ zsin Boos A < 4/ (x + zc0s B)? + 2? sin? B 


Another application of Cauchy-Schwarz thus yields 


zsinA+ysin B+ zsinC < va + y?)(a? + 2? + 2x2 cos B + sin? B), 
Thus, it remains to prove that 


a +2? 42zcosB+1~cos?B< lta? +22 42222, 


which is equivalent to (ez — cos B)* > 0, 


We now enter the zone of challenging problems, with an unusual one. 
The first solution seems to come from nowhere (well, it actually came from 
the author’s imagination...), but we also present a beautiful geometric proo' 
of Richard Stong, which makes things rather clear. 


19, Let: a,6,c,2,y, 2 be real numbers and let 
Asmar+tbhy+ez, B= aytbz+ex, C= az+ br + cy. 


Assuming that min(/A ~ B|,|B - Cl,i¢ — Al) = 1, find the smallest 
possible value of (a? + 8? + ¢?)(x? + y? + 2%). 


Adrian Zahariuc, Mathematical Reflections 


Proof. Note that by Lagrange’s identity and the Cauchy-Schwarz inequality 
we can write 


(a? +67 + 7) (a? + y? + 2?) 


= (ax + by + cz)? + (ay ~ bx)? + (bz — ey)? + (ce ~ az)? 


eee (ay ~ ba + bz — cy + ce ~ az)? 
= 3 
= 4? + \B-cP 

ae 


Since everything is symmetric, we obtain 
(a? + OF + Va? + y? + 2) 
—~cPr ~ Al? | 
eal BP4 e~ P o24l4 


> max (# + 


2) 
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Using the hypothesis, it is immediate to check that the last quantity is at 
least 4. 

To see that the answer of the problem is 4, it remains to find a 6-tuple 
(a,b,c, 4, y, z) satisfying the conditions of the problem and for which 


2. 4 4 
(2 +8? 4 2)(a2 +92 +22) = : 
Taking A = 1, B = 0,C = —1 (this is a triple which minimizes 


IB-CP po, IC-AP oo, Beer) 


, BPs : 
8 3 <3 3 


max (# + 


under the restrictions of the problem), we must ensure that we have equality 
in the Cauchy-Schwarz inequality. Solving the corresponding system yields, 
after some tedious but easy work, suitable values for a,b,c, z, y, z, namely 


b=0, c=-=, r=y=1, 2=-2. ‘= 


Proof. Let u be the column vector with entries (a,b,c), v the column vector 
with entries (x,y,z) and let R be the linear map 


x 610 x y 
Riy}=;0 01 yl = [2 
z 1060 z x 


Then A=ulv, B= ul Rv, C =u" R*v and we want to minimize {lull - ||ul!?. 
Note that R acts on R° by fixing the line x = y = z and rotating 120° in the 


Write w = ui, + ug, where w lies on the le ¢ = y = z and we lies in the 
plane «+ y+ z = 0 and similarly for v. Then 


lfrell? = Heer ff? + |feeg|[? and uP R¥y = uly; + uf Rey. 
1 


Thus the contributions of uj and v1 to A, B,C are all the same (hence cancel 
in |A ~ BI, etc.). Thus clearly the minimum occurs when u, = vy = 0. In 
this case, R acts as a 120° rotation on v, hence v + Ru + R°v = 0 and thus 


AT 


A+B+C=0. Recall that cos a+ cos(a + 27/3) + cos(a + 47/3) = 0 for any 
a. Tf we now take w to be the angle between u and v, then we compute 


A? + B? 4 C? = |lull? - |ful]?(cos® w + cos?(w + 2/3) + cos*(w + 4/3) 
1 : 
oe sie? + jlelP(3 + cos Qu + cos 2(w + 2m /3) + cos 2(w + 4/3) 


3 "i 
= Sllul? ol? 


The conditions A+ B+C = 0 and min(|A — BI, |B ~C|,)C — A}) > 1 
imply that A? + B?+C? > 2 (without loss of generality, assume that A and B 
are nonnegative. As |A ~ B| > 1, we have max(A, B) > 1, so A? + B? +07 = 
2A + 2B? + 2AB > 2max(A, B)? > 2) and so 


fel? ol? = (2 +P + AN? ye + 2) > 5. 
it is easy to see from the proof above that equality is attained. Simply choose 
any (x,y,z) with «+ y+ 2 = 0 and choose (a,b,c) witha+b+c = 0, 
orthogonal to (x,y, z) so that A = 0, and scaled so that B = 1. For example, 
taking (2,y, z) = (1,~2,1) we see that (a,b,c) = (—1,0,1)/3 suffices. a 


We present three short solutions for the following problem. However, none 
of them is really easy or natural. 


20. Let a,b,c,d be real numbers such that 
(a? + 1)(b? + (2? +1) +1) = 16. 
Prove that. 


~3 < ab+be+ cd + da + ac+ bd — abed < 5. 


Titu Andreescu, Gabriel Dospinescu 
Proof. Write the inequality in the form 


(ab + be + ed + da + ae + bd — abed ~ 1)? < 16. 
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Next, apply Cauchy-Schwarz in the form 


(a(b + +d — bed) + be + bd + ed - 1)° 
< (a? +1) [(o+c+d ~ bed) + (be + bd + ed -1)*}. 


The miracle is that we actually have 
(b+c+d— bed)” + (be + bd + ed — 1)? = (b? +1)(c? + 1)(d? +1). 


Of course, this can be checked by brute force, but perhaps nicer is determining 
it through two applications of Lagrange’s identity. 


Proof. We will use Lagrange’s identity and Cauchy-Schwarz in the form 


[ft +0) = [(e +0)? + (1 — 08)} [(e + a)? + (1 ed)? 
2 [(a t+ b)(c+ d) ~ (1 — a6)(1 — ed)}?. 


Note that 


{a+ b)(e+ d) ~ (1 ~ ab)(1 — ed) 1+ ab + be + cd + da + ac + bd — abed. 


A moment of thought shows that what we have just proved is exactly the 
a 


desired inequality. O 


Proof. Here is a rather unnatural, but very elegant proof. Consider the poly- 
nomial 
P(X) = (X — a)(X — b)(X — c)(X ~ d) 


and observe that the hypothesis becomes P(i)P(~i) = 16. This can also be 
written as |P(i)|? = 16. On the other hand, we have 


P(i) =1— ab + abed +i (ST a— Sate). 


We deduce that 


(1- Shab + abea)® + (Sra~ abe)’ = 16, 


Hiadioluinnt messes saace 
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from where the conclusion follows, as the desired inequality can be written as 
|L~ So ab+ abed| < 4. 


Straight from the Book, isn’t it? mi] 


We continue with a really nice, but rather technical problem. It re- 
quires some delicate algebraic computations and a rather exotic application 
of Cauchy-Schwarz. We also present a more conceptual proof, due to Richard 
Stong, which uses more advanced tools, but proves much more. 


21. Let a,b,c, d,e be nonnegative real numbers such that a? +6°+c? = d?+e? 
and a4 + 64 4 ¢4 = d?+e4. Prove that a3 +6843 < +3. 


IMC 2006 


Proof. Since we only deal with even exponents in the hypothesis, let us square 
the desired inequality and write it in the form 


So a8 +257 a3s* < do + && + 20463, 


Next, the identity 


a® — 3a2b22 = 5 (Soe?) (8dc0 _ oS #)’) 
combined with the hypothesis easily yield 
a® +68 + 8 = 3a7b*c? + d®& +4 &%. 
Thus, we need to prove that 
257 ab? + 3076? < 2(de)® = 2(a7b? + Be? + ca?)¥?, 


With the obvious substitutions, this is equivalent to the inequality 


2 » x? + 3ayz <2 (se) . 
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Using Cauchy-Schwarz 


} 2 
(25028 + 3ayz)? = =(> x(2a? + y2)) 
it remains to prove that 
% o\2 
See? +y2z)? <4 (”) ; 


which follows immediately from 2?y? + y°2? + 227 > xyz(x + y+ 2), itself 
equivalent to S\(ry ~ xz)? > 0. D 


1A 


es i) oes + y2)’) 2 


Proof. Consider the polynomial p(t) = t8 — o1t? + oot — 03 with o; and ae 
fixed and a3 allowed to vary. Regard the roots «, y, z of p(t) as fimctions of og. 
Suppose f : R > R is any smooth function (at least three times differentiable 
for the discussion below). Define a function 


gos) = fle) + f(y) + fl). 
Then g is a differentiable function of 3 and 


yo f@ Fw) 
es) = Ga ye-a * G-DU— 


for some ¢ with min(z,y,z) < ¢ < max(z,y,z). To prove the first equality 
one merely checks that the curve 


s 2 = 
ds)=(*+ e-erytt wus)" aren) 


preserves o1, has ‘a! =x 0, and has Aga ar 1, For the second equality 
“ bss 
note that 


f(t) - oe -yt-2) _ fWMt-t-2)  f@t-at- 


ae y-2y—a) (z= 2)(z—y) 
vanishes at ¢ = «,y,z. Hence by two applications of Rolle’s Theorem, its 
second derivative vanishes at some ¢ in (min(z, y, 2), max(a,y,z)) and this is 
the required ¢. 
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Apply this to 2 = a”, y = b?, z = c*, and f(u) = u8/?. This amounts to 
choosing £,y,z to be the roots of p(t) = 3 — (d* + e?)t? + det ~ o9. Since 
f'"(¢) <0 for all ¢ > 0, we see that g is a decreasing function of 03 for 03 > 0. 
Thus g(0) > g(o3) which, unwinding definitions, is the desired inequality. 

This argument shows that the same inequality holds for any exponent 
strictly between 2 and 4 and gives similar (sometimes reversed) inequalities 
for other exponents, ima 


The reader will probably appreciate the beauty of the following inequality. 
It is more difficult than it appears at first sight, as the obvious application of 
Cauchy-Schwarz fails rather badly. 


22. Prove that for any real numbers 71, 22,...,2n the following inequality 
holds 


= 
non a ae a 
SY ia-al} < Ae) Pm 


isl j= 15 


|g af? 
1 


n 
IMO 2003 

Proof. The first step is to order the «;’s, say 21 < a <--+ <a. Then 

y |ag ~ zy) = 237 (ay j ~~ Ui) = UA -(n— 1a, — (n ~ 3)ng +--+ (R—Uan). 

igmk i<j 


Thus, applying Cauchy-Schwarz yields 


Seca) <4. [em Qi + |: zi. 
ag 


i=1 i=1 


It is easy to compute the last expression and the final estimate that we 


obtain is 
4anin’ 
Eb-ni) <i 
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On the other hand, Legendre’s identity shows that 


te 
s (xj -2;)? = mn oa —~2 (=) 4 
ijl 

Well, unfortunately, when we combine all this we see that we are not 
done, because of the bad term 2 (S77. vi). Fortunately, it is easy to repair 
the argument: indeed, we can always add the same number to all 2;’swithout 
changing the hypothesis or the conclusion of the problem. Thus, we may 
assume that 2] +22 +---+2, = 0. But then the previous inequalities allow 
us to conclude the proof. a 


The following problem is a very tricky application of the Cauchy-Schwarz 
inequality. The technique used in the proof is worth remembering, since it is 
quite useful. It is also a standard tool in analysis and probability (it is actually 
likely that the following problem is inspired by probability theory). 


23, Let @1,42,...,@, be positive real numbers which add up to 1. Let n; be 
the number of integers & such that 2)~? > a, > 27. Prove that 


s (is <4+ v/loge(n). 


il 
L. Leindier, Miklos Schweitzer Competition 


Proof. Choose a positive integer N and split the sum in two parts: the one 
for i < N and the one for i > N. Apply the Cauchy-Schwarz inequality for 
each of them to obtain 


On the other hand, we have 


Dee ss and Sons Soni =n, 


i>N i>N >t 
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the last relation being obvious by definition of the n,’s. Finally, and most 
importantly we have 


N 


VesEr (| YD )<hee 
kok 


ist at 2-4 <a, S2t-t 


Putting these inequalities together, we deduce that 


{nt jn 
> 5 < gy t VN. 
1 


Taking N = logs(n), we obtain an even stronger (and strict!) inequality, in 
which 4 is replaced by 1. im 


We end the series of moderately difficult problems with a very nice-looking 
improvement of an IMO 2004 problem. 


24. Let n > 2 be an integer. Find the greatest real number k with the 
following property: if the positive real numbers 21, 22,...,2y, satisfy 


i ; 
k > (ay +29 4+-+++2n) (f+eee+2), 
nae 


then any three of them are sides of a triangle. 


Adapted after IMO 2004 


Proof. The main point is to solve this problem for n = 3. If b,c are positive 
real numbers, let us look at the possible values of 


f(a) = f@+o+o(¢ +54 :) 


when « > 6+ c. It is not difficult to check (either directly or by computing 
the derivative) that f is increasing in this domain of x, so that 


f(a) > flo+e) =2b+e) (G+ i eter) =242hto" 5 > 10. 
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We deduce that if x, b,c satisfy f(z) < 10, then «, b, ¢ are the sides of a triangle 
(since everything is symmetric in z, b,c). Thus, for m = 3 the answer is 10. 

To reduce the general problem to the case n = 3, we use Cauchy-Schwarz, 
which allows us to obtain information about 1, 22,23 knowing that 


1 1 i 
k> (e, +224 stan) (2 t eet he i: 
Ee a 


Indeed, using Cauchy-Schwarz and the inequality 


na By 


(eat ty) (Ete 2) 2-3) 


we obtain 


f 
1 1 i 

Vk >n 8+ le tar+es)( ' i” 
I C3 


so that 


1 a Serf _ é 
(x1 + x2 +23) (3 to + +) (VE -—n +3). 
ay £2 £3 


Using this and the previous case (n = 3), we deduce that if 


ae 1 
(0-34 VIOP > (tate tm) (StS tee e), 


then any three among the numbers 2, 22,...,2, are the sides of a triangle. 
Tt remains to check that this is indeed optimal. To see this, choose 


TH Mp o_ el, AS TQ, Tyg mye. 
Then x1, %9,23 are not the sides of a triangle and 


dit i ; 
(ay + @2 +-++- +2) (2 me bt +) = (n- 34 V10)?. 
Zy 22 ei, 


Thus, the answer is ky, = (n ~ 3+ V10)’. 
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We give two proofs for the next difficult problem, The first is based on a 
very tricky application of Cauchy-Schwarz combined with a mixing-variables 
argument, while the second one is a pure, but very technical, mixing-variables 
argument. 


25. If a,b,e,d,e are real numbers such that a +b+c+d+e=0, then 


30 
7 


@+P4eP4 Peel < lattice di tel). 


Vasile Cartoaje 


Proof. First of all, we may assume that among a,b,c,d,e at least three of 
the numbers are nonnegative, say a,6,c. This follows immediately from the 
pigeonhole principle, possibly after changing the sign of all numbers. The key 
step is the following very tricky application of Cauchy-Schwarz: 


9(a* + bt + ch tat + 64) = [O(a4 + bt + c#) + 2(a? + e*)] + (7a) + (Ze*) 


en — 2 
44 hf 4) (04 a 4 , 
" (oy +H ch) + Ad + A) He) 


This reduces the problem to showing that 
aV/olat ++ A) + O(a + 4) > V2I(0? + P+ 2). 
To exploit the relationship between a, b,c,d,e, we use the fact that 


G4el> (d+e)* _ (a+b+o)4 
2g ag 
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Thus, it remains to prove that 
36(a4 + 64 +64) + (a t+b4+c0)! > 21a? +? +2)? 


for all nonnegative numbers a, 8, c. 
This inequality does not seem to follow easily from well-known results, so 
we will employ the powerful technique of mixing variables to prove it. Let 
F(a, b,c) = 36(a4 + bt +c4) + (a+ b+40)4 — 21(a? +B? +07)? 
= 15(a4 + b4 + ct) ~ 42(a76? + Pe? 4 ca?) 4+ (a tb +0)*. 


We will first show that for a = min(a, b,c} we have 


b+e b+e 
2 3 ‘ 


f(a,b,e) > f (74 —— 
For this we compute 


fla,b,e)-—f (252,252) = 15 (+e eee 


7 4 
~— 42a? (¥ + ~ Oxon) - 42 G as eee) 
= a0 ~ c)? (426? + 92be + 42c? ~ 56a"] > 0, 


where the final inequality follows since 286? + 28¢” — 56a? > 0. Thus we reduce 
to the case where a < b = c. In this case we compute 


f (a,b, b) = 4(b4 + 86° — 1567a? + 2ba? + 4a4) 
= 4(b — a)?(6* + 10ab + 4a") 
>0. 


The result follows. 
Note that we have equality for a = b= c= 2 and d=e=~-—8. 5 
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Proof. We will use a mixing variables argument. Let 
F(a,b,¢,d,e) = 30(a4 + 44 cha dt+el)-707? +P 4240? 4 2). 


We want to prove that for a+6+c+d+e = 0 we have F(a,b,c,d,e) > 0. 
The basic formula we need is that 


dt+e as) 


er) 
= (d—¢)*(2id? + 34de + 2le? — 7(a? +.B? +c), 


F(a,b,c,d,e} ~ F (« b,c, —— 


which can be checked by tedious computation. 

By the pigeonhole principle three of @,6,c,d,e must have the same sign 
(counting zero as having either sign), By symmetry, we may assume a,b,c > 0. 
Then 


Ta? +P +6) <atb+e) =7d+eP 
<17(d +e)? +. 4(e? + e?) = 2d? + B4de + Qe”. 


Therefore by the formula above F(a,b,c,d,e) > F (a,b,c, 42, 4£). Thus we 
may assume three of a, b,c, d,e are nonnegative and the other two are equal. To 
keep the same basic formula, we invoke symmetry and switch our assumption 
toc,d,e > 0 and a=6= —(¢+ d+ e)/2. Now suppose c < e, Then we have 


1(?+P4+e)= 5c +dte)* 477 


< id + 2e)? + 7e? = ad + 1d4de + 21e7 
< 21d? + 34de + 2te?. 


Therefore we again have F(a,b,c,dje) > F (a,b,c, S*, 442). We conclude 
that we can repeatedly average the largest of ¢, d,e with either of the other 
two and we will only lower F. By continuity of F, we therefore reduce to 
the case where a@ = b and c = d = e. In this case it is easy to check that 

F(a,b,c,d,e) = 0 and we are done. Q 
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We end this chapter with a challenging inequality, which combines some 
clever uses of Cauchy-Schwarz with a tricky homogeneity argument. This 
result also appears in [35] and it is a generalization of a problem discussed in 
[3], chapter 2, example 11. 


26. Prove that for any positive real numbers aj, d2,...,46n, 21, £2,..-;20n 
such that 
y aa~ (5), 
ici<jen 


the following inequality holds 


ay 
ag te +4 


(wat+--++an)4 
n 


Vasile Cartoaje, Gabriel Dospinescu 


Proof. First of all, it is enough to prove that for any 71,22,...,2, > 0 and 
any @1,42,...,4, > 0 we have 


ay 
————— (#9 ++--+2 > ip. 
b Dremel ye 2 


Since this is homogeneous, it is enough to prove it when 21-+ze+-+--+7,_, = 1. 
In this case, it is equivalent to 


a ayty 
> $ 
p Dncerecers 22500 9a : 


But Cauchy-Schwarz shows that 


2 
ay z 
Berger OM eres, a7 
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and a second application of Cauchy-Schwarz yields 


[+0 (Sten) (eaee Ea 


1Si<jsn 


2 2 
(= ele ) eS aL 
Ages + an n-1 Gy fs + ay 


A very elegant approach for this inequality was proposed by Darij Grinberg 
in [35], where a more general result is proved. We may assume that 


Qt ag ts thay = 1. 

We need to prove that 

<< iy n 

a G~a)Q—a;) ima) = In — 2 

Using T2’s lemma (a form of Cauchy-Schwarz), we reduce this to proving that 
2 


n 
SS aia; 2 m2 S- ajaj(1 ~ ai)(1 — a3). 


i<j i<j 


Note that 


i-~Yia? J 
SY aia; = BiH =3 So as(t ~ a4). 
4 


i<j 


Thus, if we let 6; = a;(1 — a;), it remains to prove that 


(by ee ee Ste 


i<g 
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This follows directly from Cauchy-Schwarz and the identity 


2S baby = (by + ba to + bn)? ~ BEB + + OF). 


i<j 
Remark 2.1. We can also prove the inequality 
2 fiaitsa) 2 ed 
i<j 
by mixing variables: consider the map 
F(@1,@2,...,@n) = 2y0 ees ear 


i<j 


and set = ape We claim that F(ai,a2,..-,@,) > P(x, 2,a3,-..,Gn). 
A small computation shows that this is equivalent to 


2azag 22? 7 
Want aaa 0. 
+G-ajG—m) G-a = 


Another computation shows that 


and 
2x? Qaran (a, ~ a2)" 7 1-2z 
(1-2)? (1—a1)(1— a2) 2 (L—2)*(1 ~ a1)(1 ~ a2) 
Thus, the inequality F(a,,42,...,@n) > F(x,2,a3,-.-,4n) is equivalent to 


2 


23 
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But this is easy, since the left-hand side is at least 237.3; = 2(1 — 2z) 
and 2(1 - 2z) > 4:28 is equivalent to (1 — 2x)? > 0. Thus, we have 
F(a1,02,.-.,4n) > F(a,2,03,...,@n). Continuing to mix variables in this 
way and using the continuity of F implies that F(a,,a9,... +@n) is at least: 
F(m,m,...,m), where m is the arithmetic mean of the a;’s, namely 1/n. The 
result, follows. 


2.1 Notes 


The following people provided solutions to the problems discussed in 
this chapter: Vo Quoc Ba Can (problem 20), Ta Minh Hoang (problem 17), 
Mitchell Lee (problem 6), Dung Tran Nam (problem 25), Dusan Sobot (prob- 
lems 1, 2, 5, 7), Richard Stong (problems 14, 19, 21, 25), Gjergji Zaimi (prob- 
lems 3, 4, 10, 13). 


Addendum 2.A Cauchy-Schwarz in 
Number Theory 


This addendum shows some very beautiful applications of the Cauchy- 
Schwarz inequality to number theory problems. We present Gallagher’s sieve 
and a beautiful arithmetic application; we discuss the large sieve, an amaz- 
ing tool invented by Linnik and extensively developed by a series of brilliant 
mathematicians; finally we discuss another famous result, the Turan-Kubilius 
inequality. The Cauchy-Schwarz inequality plays an important role in the 
proofs of all these theorems, which are elementary but quite powerful: this 
ought to show the reader how Cauchy-Schwarz appears in “real mathematics” 
and not only in olympiad-type problems. Analytic number theory has the rep- 
utation of being rather technical and this addendum is no exception. We hope 
that the results presented here (especially their applications) will compensate 
for this nonetheless. 

The following two sections try to give satisfactory answers to the following 
natural problem: suppose that A is a set of integers such that 


A (mod p)= {x {mod p)ic € A} 


is relatively small for all primes p in a finite set P. Are there nontrivial bounds 
on the size of A? 


2.4.1 Gallagher’s sieve and a nice application 


Recall that the von Mangoldt function A is defined by A(p) = logp if 
p ig a prime and n > 1 and A(x) = 0 for any other integer 2. The crucial 
property of A is that 


S- A(d) = logn 


din 


for all x. The following theorem is a pretty tricky application of the Cauchy- 
Schwarz inequality, but the application given below reveals its usefulness and 
power. 
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Theorem 2.A.1. (Gallagher’s larger sieve) Let S be a finite nonempty set of 
integers and let P be a finite set of prime powers. Assume that for each p © P 
we can find a real number u(p) > |{s (mod p)|s € S}| such that 


>= MP) ot0g x, 
Sop P) 
where X = maxgeg |s|. Then 
Dupe A(p) — log 2X 
per Ap) -log 2X 


Proof. Let p € P and let s(r,p) be the number of elements of $ that are 
congruent to r modulo p. Then by Cauchy-Schwarz and the fact that 


(p) 2 |{s (mod p)|s € S}| 
we have 


et a pol 
\sP = (= Hn) < ulp) 7 s(r,p)*, 


r=0 r=0 
thus 


is? 
=| 
aoe y l= |Si+ S 1. 
r=0 81,8269 $y F82ES 
313 ‘7 (mod p) pisi-s2 


Multiplying this by ie and summing over all p yields 


is? yo 42) aos siS}S°A@)+ SOS Aw 


peP peP $1482 playa 

As 
S= Alp) < log(|s1 — s2|) < log 2X, 
pisi~se 
we deduce that 
isi? > 2 < 15] DAG) + (/SP ~ isiploe2x, 

per * peP 

from which the result follows immediately. CQ 
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The promised application (taken from [19]} requires some preliminaries. 
The following result is very classical, being related to the following natural 
question: given two positive integers, what is the probability that they are 
relatively prime? 


Proposition 2.4.2, As x — 00, we have 


SY eh) J 52 +O(«lnz). 


Proof. The key point is the equality 
lk) _ yn oH 


k ae d 


which follows easily from the classical formula 
a 
o(k) =k-J] (-3). 
pik 


This relation yields 


ks ksa | dit 
ud) j 
Sa ae 
dex a4 
= >> a(d)- [3] @ + (8) 
dee 2 
x (d) z 
= Me +01 Soo 
2 dsa e dSa d 


Note that Soj<, 4 = O(w@lnz). Next, we claim that 


ad 6 


d>1 
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This follows from Euler's celebrated formula }°,,5., ed = . and the following 
computation! 


u(d) 1 pd) 1 a 
> ez Die * oe (nd)? aa Lua) =i 


dh n>1 kD) 
Finally, as . 
u(d} a ( 1 ) 
rel < Dag? 
oy = > 
d>e [  d>x dd 1) % 
we obtain the desired result by combining the previous observations. o 


Theorem 2.4.3. Let a,b > 1 be integers such that for any prime power p 
there exists k > 1 (depending on p) such that b = a* (mod p). Then b is an 
integral power of a. 


Proof. We may assume that a > 3 (as we may work with a? and 6? instead of 
a and 6). The most difficult step is to establish that Ina and Iné are linearly 
dependent over Q. Let us assume that this is not true and consider a large 
number z. Let S, be the set of numbers smaller than x and of the form a*-b', 
with 7,7 nonnegative integers. As Ina and Inb6 are linearly independent over 
Q, the set S$, has the same number of elements as the number of pairs (i, j) for 
which ilne + jb < Ina. So, there is an absolute constant ¢ > 0 depending 
only on a and 6 such that |S,| > c(Ina)?, We will bound |5,| from above 
using Gallagher’s sieve. 

For any positive integer y let P, be the set of prime powers dividing at 
least one of the numbers a—1,a*—~1,...,a¥— 1. For each p € P,, let u(p) be 
the order of a mod p. Then u(p) < y for all p € P,, and, since 6 is a power of 
a modulo p, we have |S, (mod p)| < u(p) for all p € P,. To continue, we need 
a technical lemma. 


Which uses the absolute convergence of the double series, as well as the fact that 
Doan H(d) equals 0 for all k > 1 and 1 for & = 1. 
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Lemma 2.A.4. There exist constants c,c2 > 0, depending only on a > 3, 
such that for all y > 1 we have 


ey? < SO Alp) < cay’, 
PEP, 


Proof. One estimate is very easy, since 


3 Aw) < s 32 At) gry ~< WWD Ina. 
j=l 


pePy jal dlai-1 
The other estimate is more delicate and crucially uses properties of cyclotomic 
polynomials? and proposition 2.A.2. Let ¢, be the nth cyclotomic polynomial. 
Remark 9.5 implies that 
S> A@)= SF Aw) - SDA) = Indala) — nd. 
ulp)=d pidala) pid 
By the very definition of dg we have In ¢q(2) > y(d) - In(a ~ 1). Hence 


37 Aw) = > ST Aw) = |S e@ | ine-)- Sona. 


pePy asy ulp)=4 d<y dy 


Since > Ind = O(ylny), it suffices to use proposition 2.A.2 to conclude. 0 
dsy 
The previous lemma shows that by choosing ¢ correctly and y about clnz, 


we can ensure that 


= A@) J 1 S7 At) > 2in(2e) 


and so by Gallagher’s sieve 


1 


Sz] S in(Qz) 


Ss A(p) ~ In(2x) | < eglng 
pEePy 


Ror more details the reader is referred to section 9.2. 
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for an absolute constant c3. This contradicts the first paragraph for large x. 
Hence Ina and In6 are linearly dependent over @ and so there exists an 
integer c > 1 and positive integers i, 7, relatively prime, such that @ = c' and 
b= c!. If p is a prime power divisor of c — 1, there is k, such that p divides 
? — 1 and so p divides & ~ 1. We deduce that c’ — 1 divides c — 1 and so 
é ; divides j. The result, follows. 


Remark 2.4.5. The result does not hold if we consider only primes instead of 
prime powers. For instance, using properties of quadratic residues (not more 
than the multiplicativity of Legendre’s symbol) one can easily prove that 16 
is an 8th power modulo any prime. Of course, 16 is not an eighth power of 
an integer. In the beautiful papers [4] and [32,9 the following general result 
is proved fully using techniques of algebraic number theory: 


Theorem 2.A.6, Let n > 1 be an integer and let a be an integer such that 
ais an nth power modulo any sufficiently large* prime. Then either a is the 
nth power of an integer or 8|n and a = 226" for some integer b. 


2.4.2 The large sieve 


This rather long section presents a very deep result in analytic number 
theory, known as the large sieve. Introduced by Y, Linnik and extensively 
developed by Renyi, Bombieri, Davenport, Montgomery, Vaughan, Gallagher 
(see [8], [9], (14], [25], [37], [44], [46], [57], [55], [56] to cite only a few refer- 
ences}, this became a basic tool in modern analytic number theory, with rather 
spectacular results. We start by presenting a vast generalization of a famous 
inequality of Hilbert, due to Montgomery and Vaughan, which, combined with 
some very clever tricks, yields the analytic form of the lange sieve inequality. 
Combined with standard results in finite Fourier analysis (for which the reader 
is invited to read addendum 7.A) this yields arithmetic forms of the large sieve. 
This has some amazing applications to the distribution of prime numbers, twin 


5Also see theorem 9.B.60. 
‘Actually, the proofs show that it is enough to assume that this holds for a set of primes 
of Dirichlet density 1. 
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primes, least quadratic residue, prime numbers in arithmetic progressions, etc. 
We follow rather closely the amazingly well-written article [56!. 


The analytic form of the large sieve inequality 


We will spend some time proving the following deep theorem, known as 
the analytic form of the large sieve inequality. Let {lal} = minne |e ~ | be 
the distance from the real number ¢ to the discrete set Z. 


Theorem 2.A.7. Let x1,2,.--,2n be real numbers such that 
haa -— z;\| ze>rd 


for alli # j. Let 
T(z) = > py BRE 


McksM+N 
be a trigonometric polynomial, where M,N €& N and zyqiy.-.s2M4Nn © C. 
Then 
n 1 . 
Siege s(N+z) So bal 
j=l M<k<M+N 


Theorem 2.4.7 has a long history and many mathematicians contributed 
to it: Davenport-Halberstam, Gallagher (who gave a very simple proof of the 
inequality with 7N +4 instead of N + 4), Montgomery and Vaughan, Selberg. 
One can prove (this is due to Selberg) that the factor N + i can be improved 
toN-1+ 2 and that this is sharp. The proof of theorem 2.A.7 will span over 
the next sections. 


Tools from linear algebra 

In this section we recall a few standard facts about inequalities concerning 
matrix norms and we prove a duality principle. Recall that the standard her- 
mitian product on C* is defined by (ay) = Dey Tie, Where 2; (respectively 
ys) are the coordinates of x (respectively y). Ifv € C®, we denote luj? = (v, 0). 
Definition 2.4.8. A matrix A = (aj) € Mp(C) is called hermitian if for ail 
x,y € CT we have (Az, y) = (x, Ay). 
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The reader can easily check that A = (aij) € M,(C) is hermitian if and 
only if aj; = Gj for all i,j. A fundamental result of linear algebra is that for 
any such matrix A we can find real numbers \;, Ag,-.-, An, and an orthonormal 
basis v1, v2,...,Un of C® (ie. (vj,u;) = 0 if i # 7 and 1 otherwise) such that 
Av; = Ait; for all i. This can also be stated as: all eigenvalues of a hermitian 
matrix are real numbers and there exists an orthonormal basis consisting of 
eigenvectors.” The following result will be very useful in the next sections. 


Proposition 2.A.9. Let A be a hermitian matrix and let C > 0. If the 
inequality |(Av,v)| < Clu]? holds for any eigenvector. v of A, then it holds for 
anyve Cr. 


Proof. Ui, U2,.+-,Un be an orthonormal basis of eigenvectors, with cor- 
responding eige’ aS si E " i 

sp a g eigenvalues Ay,A2,..-,An- Consider any v € €” and write 
uv = S01, tiv for some a; € C. Then 


a 
(Av, v) = 37 aeR5Ac- (vi, 07) = 7 da |i? 
ag t=1 


By hypothesis we have |(Av;,v:)| < C]v;|? for all i, and since Av; = A; -;, we 
must have jA;| < C. Hence i 


nm 
(Av, v)} < CS" las? = Chel? 
1 
; We end this section with a very useful technique. Though elementary, it 
will play a key role in the proof of theorem 2.A.7. 


Proposition 2.4.10. (Duality principle) Let (aij)i<i<masicn be complex 
numbers and let C > 0. The following are equivalent: a. 


1} For all = € C we have 
n 


™m 2 Mm 
2 [ows] SCD lal 
i=} =i 


= (aij) € M,(C) is any matrix and if A ¢ C and v € C” i 
a Mn(C) €C" — {0} satish 
Av = \-v, then we say that v is an eigenvector of A associated to the ae H 
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2) For all y, € C we have 


me i nm nr 
ayy] < OV tw? 
| 


i=l [jel =] 


Proof. Assume for instance that 1) holds, Then for all z,y; € C we have, by 
Cauchy-Schwarz 


m nm 2 n 2 
SE ajziys| < Soy? SOS aig2i 
a z i 


i=l j=l j=l 


soy lad? So ly! 
3 3 


By choosing for z; the complex conjugate of > 7 @ij¥j, we obtain the inequality 
in 2). The converse is proved in exactly the same way. a 


Montgomery and Vaughan’s theorem 


The key technical ingredient in the proof of theorem 2.A.7 is the following 
delicate result [57]. 


Theorem 2.4.11. (Montgomery and Vaughan) Let 2, 22,...,0 be real 
numbers such that ||x; — 2;|| > ¢ > 0 for alli Aj. 
Then for all z,22,...,2n EC 


The proof of theorem 2.4.11 is a very nice mixture of elementary, analytic 
and algebraic arguments. A first crucial ingredient is Euler’s famous identity 
(that we will take for granted) 


c ae 
= lim Pa 
singz N18 
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Since 


x 
=e 7) 


k=l 


(the last equality is immediate, since aie ~> 0 as n - 00), we can also 


write i 
ee \ALY (=1) 
ain me: = Ne om (- Wr) atk’ 


Thus, it is enough to prove that for all N > n we have 


-<v \k ~i* rT 
a5 yO) aaoa| Set 


i4j IkISN 


Now, since there are N — |k| solutions of the equation j) ~- jo = k with jy. Jo € 
{1,2,...,.N}, it is not difficult to see that the inequality is equivalent to 


3 5 — | =n 


— lal. 
~ th ~ jal é lal 


i Fig ISAgesn ™ 


This follows from the following vast generalization of a famous inequality due 
to Hilbert, applied to the family of real numbers (a; + j)1<i<n, Lyn and to 
the family of complex numbers ((—1) zi i<i<n, I<jene 


Theorem 2.4.12, (Montgomery-Vaughan) Let e>0 and let x21, 22,...,4,€R 
be such that |2j ~ 2j| > € for alli # j. Then for any complex numbers 
21s 2250-05 2m 


iA 


Proof. By homogeneity and symmetry we may assume that ¢ = 1 and that 
£1) < £2 <+--<a_. Then the hypothesis implies that #j;~2; > j~ifori <j. 
Consider the matrix A, where ay = ligg aay 7 . Note that A is antisymmetric, 
thus i- A is a hermitian matrix, to which proposition 2.A.9 can be applied with 
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C= 7. Thus, we may assume that z= (z1,..., 2n) is an eigenvector of iA, so 
also of A, say Az =i- Az for some 4 € R. 
By Cauchy-Schwarz, the square of the left-hand side of the desired in- 


equality is bounded by ST, jz? - 3); |Dojgs az , $0 it is enough to prove the 


inequality rt 


Expanding brutally the left-hand side and using the crucial observation that 
aizGix = Ojn(4j ~ dix) yields 


LHS = $7 22x > aijaie 
jk 


hj k 
— ss be,|?- Sai; + So Bena Say - Soe + Qajn 
i a7 ape a7 ipk 
It is now time to use the fact that z is an eigenvector: 


SHaen (To) Ts [Dawe] [Tov 


jzb es 3 Aj wi 


= ay? lz3\7 SS a 
j 


tAg 


Doing the same with the other sum, we finally obtain that the term 
- ~ 
do aeeaye | Las 
j#k if 
cancels with the other similar term, so that 


LHS = $7 fajl?- $7 a2; + $0 2a zn 0}. 
g 


ar fk 
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Now, using the AM-GM inequality |22jz;| < |z,|? + |za[* and rearranging 


terms, we obtain 
LHS < 37 [z)|?- So a3. 
j iff 
As |x; — 2;| > |i - j|, we have for any fixed j that® 


oe 1 2. 
ED er FL ee 


ifi ij nel 


C 


Jombining the last two inequalities yields the desired result. 


Proof of theorem 2.A.7 


Let ay; = e%***) for 1 <j <nand M<k<M +N. We need to prove 
that for all (z,)arcecm4n we have 


| 
a | 
Sy aa 
| 


j=l |M<kSM4N 


2 
i — 
< (v+ :) dele? 


and by the duality principle (proposition 2.4.10) it is enough to prove that 
for all 41,.--, 4m we have 


? 


: 
asus 


M<kSM+N [f= 


1 
<(N+- ly,l?. 
<(v+2) Thu 
j 
Expanding the left-hand side, we obtain the equivalent inequality 


ute Ment) < LT yi 


hFia MckSM+N : 9 


6 ahs >, ‘ 
"We use here Euler’s famous ide: 
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Using the formula for the sum of a geometric series, a small computation shows 
that for all u 


Ss etinku 1 ( eit U2Mt2N+1) _ eee, . 


M<k<oM+N 2 sin(x) 


By the triangle inequality, it is thus enough to prove that for any 
s€{2M+2N+1,2M +1} 


and for any yi,.--.¥n we have 


Tela crteenel cE Diy 
ib sin t(j, ~ Lj_) e& 


But this follows from theorem 2.4.11 with zj = y;-¢***7. 


A quick proof of a weaker form of the sieve inequality 


In this section we present Gallagher's short and beautiful proof of the 
following weaker form of the large sieve inequality. It is much simpler than 
the proof presented in the previous sections and the result it establishes is 
good enough in most applications of the large sieve. 


Theorem 2.A.13. Let 2,22,.-.,%n be real numbers such that 
la; -aj|| ze >0 


for allix j. Let ; 
T(z) = s Zhe evinke 


M<k<M+N 


be a trigonometric polynomial, where M,N & N and zyiiy-..,zman € C. 


Then Z 
1 
P< (ant Be 
dre) <(nv+2) tal 


M<kgM+N 
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Note that the only difference between this theorem and theorem 2.4.7 is 
the factor +N, which is N in theorem 2.A.7. 


Proof. Assume that f is a continuously differentiable complex-valued map on 
R. Integrating by parts, it is easy to establish the equality 


pa -[" 


% f(t)dt + i (t =a; + 5) f'(tat 


oy i" 3 


+ ee (t-2)-5) rwat. 


Using the triangle inequality and the fact that 


for t € [xj ~ §,x,] (and a similar inequality for ¢ € [3,2 + §]), we obtain 


1 tgkG wit 
tents =f" wold | 1g Ola 


Take f(a) = T(x)? and add the corresponding inequalities. The hypothesis 
|e: —2,|] 2 € ensures that the intervals (2; ~ §, 2; + §) do not overlap mod 
1, so using the 1-periodicity of f we obtain 


2, lade <: [ [T(o) Pde + if " (e))- [e"e)|ae. 


Using Parseval's equality and the Cauchy-Schwarz inequality, we obtain 


n M+N M+N MeN 


Siren ss So tale) So bee) So fankeal? 


jah k=M4+1 k= M4+1 koe M4 


1 M+N 
S (+27 max lex? 
G es) ye iax|". 
koM+1 
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So we are done if we can ensure that maxyecpcatin [k] < x. Of course, for 
arbitrary M and N it is unreasonable to hope for such an inequality, but a 
moment of thought shows that M plays no role in the theorem we are trying 
to prove, so we can simply choose M = — (Ae #1) to finish the proof. Q 


Arithmetic forms of the large sieve inequality 


We will apply theorem 2.A.7 for a special family of well-spaced numbers 
z;: pick an integer Q > 1 and consider all numbers of the form ? with 
gcd(a,q) = land i < @ <q < Q. It is clear that the difference between 
two such numbers is (in absolute value) at least 7. Applying the large sieve 


inequality to this collection, we deduce the following 


Theorem 2.4.14. Let 
T(x) ~ s ayerithe 


M<kSM+N 

be a trigonometric polynomial. Then for all Q > 1 we have 
a 

ot 


2 
r(¢)| <(N+Q%) SD jaw? 


M<kSM+N 


Next, let us observe that 


gq 
a dixko a, Binak 
T(2) = rae 4 y a, jer . 
q ik 


hl \k=h (mod q) 


‘Therefore, using the techniques of addendum 7.A, more precisely Plancherel’s 
formula (theorem 7.4.5, but this can also be done by expanding everything), 
we obtain “ 


q 2 g 
vr(G)f-9h] Ow 
a=} h=1 kh (mod q) 
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Now, if the a, were aie! distributed, one would expect that 


Lech (mod q) % behaves as Ey = } 37, ax. Actually, a small computation 
reveals that 
2 


2 
q q 
S} Daa -¥] 5 of -onr, 
{&=h (mod g) hzz] [koh (mod g) 


which combined with the previous formula yields 


2 
Unfortunately, }777; iff (:) 


yh 


iseS¢,ged(a,g)=1 


but if ¢ is a prime number, they are actually equal! Using these remarks and 
the previous theorem, we obtain the following strong inequality: 


Theorem 2.4.15. Let (ax) a<kem+n be @ sequence of complex numbers. 
Then for all integers Q > 1 we have 


Pp 
Sop: py > oat 5a <(N+Q) ) Sa lanl? 


PSQ heel |k=h (0d p) 
the sum being taken over all primes p < Q. 


By specializing a, = 1,e4 for a subset AC [M+1,M +N}, we obtain 
the following equidistribution result: 


Corollary 2.4.16. Let Ac [M+ 1,M +N] be a set of integers. Then for 
all integers Q > 1 we have 


Eye 


PSQ hel 


N(h-+ pZ)| — 4alp S (N+ Q*)AL. 
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This finally yields a “sieve inequality.” 


Theorem 2,A.17, Let AC ([M-+1,M +N] be a set of integers and suppose 
that for each prime p < Q at least w(p) residue classes mod p contain no 
element of A. Then 


Proof. Use the previous corollary and the observation that for each p we have 


Pp 


2s 


hel 


Al? 
ty wip) = 


2 
as at least w(p) terms in the sum are equal to ie 


The previous theorem helps understanding the name “large sieve.” In- 
deed, in typical applications A (mod p) will miss a good part of the residue 
classes modulo p (i.e. the integers w(p) will be quite large) and the sieve 
inequality will yield nontrivial bounds on [A|. The next result, an amazing 
theorem of Linnik, is at the very origin of the large sieve and one of its best 
applications. It concerns the distribution of the least quadratic non residue 
modulo p. Vinogradov conjectured that it is smaller than c(<)p* for any ¢ > 0 
and if one assumes the Generalized Riemann Hypothesis, then one can actu- 
ally prove that it is smaller than c(logp)?. The following deep theorem due to 
Linnik [47] is a first step in the direction of Vinogradov’s conjecture (which is 
still largely open): 


Theorem 2.A.18. (Linnik) Let n(p) be the least positive integer which is not 
a quadratic residue mod p. Then for all e > 0 there is a constant c(e) such 
that for ali N we have 


l{p < Nin(p) > N°}| < efe). 


Proof. Fix ¢ > 0, which for simplicity (but without loss of generality) we take 
of the form 3 for some integer d greater than 2. For a positive integer N, let 


Py = {p< VNin(p) > N*} 
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and 
Ay={newi(2) =1 for all pe Py}. 


We will prove the following technical result: 


Lemma 2.4.19. There exist positive constants c{e) and c(e) such that for 
all N > ex(€) we have |An| > cle)N- 


Let us assume for a moment that this holds. Since Ay (mod p) misses all 
quadratic non-residues mod By we can choose w(p) = Loa > Pi in the previous 
theorem and obtain [Ay] < raat }; Hence forall N > c @) we ihe iPyl < ay: 
which is enough to conclude the theorem. 

Let us prove the lemma. Note that Ay contains all numbers n < N all 
of whose prime factors are smaller than N*, as Legendre’s symbol is mul- 
tiplicative. We, will consider all numbers n = kp,-:-pq smaller than N, 


with p; € [N®~Z, N®] in nondecreasing order and some integer k. Note we 


koN ae ,asn<N, so k is relatively prime to any prime greater than N*~ 
This easily implies that all such numbers are different and they clearly ae 


to Ay. Since there are more than aon — 1 possible values for k, it follows 
that we have at least 
d 
N 1 1 
2aN -| ~a(n‘y? 
So atgcnwtedw( xt) ro 
me[Nne SNe} peENe—*F Ne] 


such numbers. Taking into account the fact? that a(x) = o(x) and 
logb 

ys = log me +0(1) 

velae] 


as a,b — oo, the last quantity is easily seen to be greater than e(¢)N for 
some positive constant c(<) and all sufficiently large (depending on ¢) N. The 
desired result follows. 


Both these results are proved in addendum 3.A. 
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Though theorem 2.4.17 is already a very strong result, in most applica- 
tions one needs more refined estimates, in which one takes into account all 
q <Q, not only prime numbers. In order to do this, we need another nice ap- 
plication of finite Fourier analysis and Cauchy-Schwarz, but before doing that 
recall that 4 is Mdbius’s function, defined by u(n) = (—1)* if n is a product 
of k > 0 distinct prime numbers and p(n) = 0 otherwise. 


Theorem 2.A.20. Let AC [M+1,M +N] be a set of integers and suppose 
that for each prime p < Q we have 


{a (mod p)ja € A}| < p—w(p). 


Then for all trigonometric polynomials T(z) = Dea axe™™ and all positive 


integers q 
Py rn (i > wo TSS eal Ircoyr 


ged(a,g)=1 * 
asq 


Proof. Let 

A = na? T[— ey, 

pig 

a multiplicative function. We will prove the theorem in two steps: first we 
will prove it when g is prime and then we will show that if the theorem holds 
for two relatively prime numbers q, q’, then it also holds for gq’. First, assume 
that g is prime. Let 

th= S ak. 

ke Akh (mod q) 

Then at least w(q) of the numbers x, are zero (by hypothesis), so by Cauchy- 
Schwarz and Plancherel’s identity we get 


: <(q- w) lea? = (1 ze “a > _ (3) . 


a=) 


from where the result follows easily. Next, assume that the result holds for ¢ 
and q', with gcd(q, q’) = 1. Using the Chinese Remainder Theorem and the 
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fact that the theorem holds for g (with T (3 + 2) instead of T(x)) and then 


for q’, we have 
“(ayh 
ag 


Py : y Geal 
ae(Z/qq'Z)* sets wih 
b (2: 
> > a|r(5) 


be(Z/q'Z)" 
2 gold )|TO)? = g(qq’)IT(O)|?. 


o 


Combining the special case T(x) = S7,¢4 €7"** of the previous theorem 
with the large sieve inequality (theorem 2.4.14), we obtain the following strong 
form of theorem 2.4.17: 


Theorem 2.A.21. (Montgomery) Let AC [M+1, M+N)] be a set of integers 
and suppose that for each prime p < Q we have 


lta (mod p)ja € A}] < p—u(p). 
Then |A| < Nag? where 


Fas 2 w(p) 
dw I— Petes 


Some applications 


We present here two classical applications of the large sieve: a uniform 
upper bound on a(m +) —{n) and an upper bound for the number of twin 
primes smaller than n. 


Theorem 2.4.22. Ifm > ./n, then there are at most a primes between 
m+landm+n. 
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Proof. Let A be the set of prime numbers between m+ 1 and m+n. No 
element of A is divisible by some prime smaller than or equal to [,/nj, so by 
theorem 2.A.21 we obtain 


|Al < mee b=) wo? T=. 


vn pla 


j : 
Using the fact that so = Yyo1 (3) , we obtain 


i 
ye s > log Vn = 3 losn. 
iva? 


Inserting this result in the previous inequality yields the desired result. 0 


Theorem 2.4.23. There exists an absolute constant c such that for all n 


there are at most Wan? primes p <n such that p +2 is also prime. 


Proof. Let f(n) be the number of twin primes smaller than n and let A be 
the set of twin primes in ([Yn],n]. If p € (2, [nl] is a prime number, then 
clearly 0,-2 ¢ {@ (mod p)ja € A}, so we can take w(p) = 2 for such primes 
in theorem 2.4.21 (and w(2) = 0), deducing that 


2 
fa) vm) =<, b= SD wa? TT 525. 


svn vid 
ged{2,g)=1 


i 
: : 3 2 ‘ : 
It remains to obtain a lower bound on £. Since oo = Dyer (2) ; we obtain 


laa, ipt+Upe- (ig $1 
pe ee ee ae 


ny ts = pe wee te 
¥ ; . . 1 Ds 
pi nplt Svat pil oplt <a 


Py Ps’ 


min pi>2 min py>2 
2 
= Ss a > a i > e(logn)’, 
kya ken 


god(2,k)=1 ged(2,h)=1 
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where d(k) is the number of divisors of k. 
Combining the previous estimates, we obtain 


= cone 
fin) — Fv) < Ge 
for some constant c, > 0. Using the trivial bound f([/i]) < Yn = O ( crx) 
the result. follows. C] 


Using the previous theorem, we can easily prove the following famous 
result, which is probably the origin of all modern sieve theories: 


Theorem 2.4.24. (V.Brun) The sum of the inverses of all twin prime num- 
bers converges. 


Proof. By the previous theorem, there are at most ey twin primes between 
2) and 2/1, for some absolute constant c. The sum of the inverses of these 
a primes is smaller than 3 - oe = a As Dist Fa converges, the result 
follows. 


These are far from being the most spectacular applications of the large 
sieve (though they are already deep theorems!). We refer the reader to 9], 
{19], [37], [44], for many other applications. 


2.A.3 The Turdn-Kubilius inequality 


In this section, we use again the Cauchy-Schwarz inequality to prove a 
famous inequality in analytic number theory, the Turdn-Kubilius inequality, 
We then apply it to establish a well-known result of Hardy and Ramanujan 
(which was actually the origin of this inequality) and then a very amusing 
consequence due to Erdés. We also present a dual form of the Turén-Kubilius 
inequality, due to Elliott, which has a similar form to the inequality established 
in theorem 2.4.15 and which turned out to have some pretty far-reaching 
consequences (for instance, a proof of the prime number theorem!). 
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A function f : {1,2,...} + C is called additive if f(ab) = f(a) + f(b) 
for all relatively prime positive integers a,b. It is called strongly additive if 
moreover f(p*) = f(p) for all primes p and all positive integers k. Thus 


fl(a)= ys f(p"™), respectively f(n) = s f(p) 


pin pin 


when f is additive, respectively strongly additive. In particular, if f is additive 


we have ‘ 
1Sa0-25 ([a]-[pal) 


phen 


while if f is strongly additive, then 


ty 0=2> [2 =) 40). 


imk psn 
Thus 
ke 
Ey(n) = > (: ~ *) fp ) respectively EY*(n) = s J(p) 
re p) pi —~ p 
pE<n pen 
is a good approximation of the average value of f(1),..., f(n) if f is additive, 


respectively strongly additive. Our fundamental result will allow us to see 


By(n) = St tee respectively By"(n) = oy: (fe)? 


pi<n psn P 


as an upper bound for the variance of f as a random variable on {1,2,...,n} 
(with the uniform distribution). Without further ado, we follow [24], which 
contains a very easy proof of this theorem. 


Theorem 2.4.25, (Turdn-Kubilius inequality) There exists an absolute con- 
stant C > Q with the following property: 
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1) For all strongly additive maps f : {1,2,...} + C and alin 
L nm 
= LF) - Bf (n)P < C- BY(n). 
k=l 
2) For all additive maps f : {1,2,...} 3 € and alin 
1 mh 
: So F(e) ~ Ep(n)P < C+ Ben). 
k=l 


Proof. We will prove only 1), as 2) is proved in exactly the same way. Suppose 


first that f takes nonnegative values. Denote E = Ef (n), B=, = ,/By*(n) and 
observe that 


S= Sir - -EP = = So ek)? - 2B Fe k) + nB?, 


kon kon 


As f is strongly additive, we have 


= TY soro= 5 [FIs ore farm so 


and the last quantity does not exceed nB? + nE, as f takes nonnegative 


values and {2} < x for all x. On the other hand, since [2] > x ~ 1 for all x, we 
can write 


~28 S* s(b) = ~28 > |] sp) < ~ane? + 28S Sep 


kn psn pen 
All in all, we obtain 


SS nB?+2ES° f(p). 
pan 


Next, two applications of ae show that 


E40) s [Sta [Pp B= Bw. aoe aE 


psn a, pen psn pen? 
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Using the classical estimates® 


1 n® 
= = O(log = 
po (loglogn), S"p o( 5): 


psn pen 


S<nB? (1+0/ Ee) 
log nr 


and the result follows. 
Suppose now that f is real-valued and define two strongly additive func- 
tions 91,92 by 


we deduce that 


(P) = 1pqy>r0°F(P),  92() = ~Lyyco f(p)- 
Using Cauchy-Schwarz, we get (with EB = Ey(n), Ej = Ey,(n)) 
2 on 
SLR) — BP s 2573 lash) - BP 
ken jel k=l 


and the result follows easily by applying the result of the previous paragraph 
to 91, 92. Finally, if f takes on complex values, set g; = Re(/) and g2 = Im(f 
and use the same argument. 


Corollary 2.4.26. (Turdn) If w(n), Q(n) is the number of prime factors of 
n without (respectively with) multiplicity, then 


See) — loglogn)? = O(n log log n) 
kn 


and 


$7 (2() —loglogn)? = O(n log log n). 
ken 


8Which follow easily from the results of the addendum 3.A. 
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Proof. Use the Turén-Kubilius inequality with f = w, respectively f = 2. 
Note that if f = w, then 


1 
E#(n) = BY (n) = a = loglogn + O(1), 
pan 


as follows from Mertens’ theorem 3.A.5. 
Similar arguments apply for f = 2. | 


The following theorem is an immediate consequence of the previous result. 
Its original proof was much more intricate (see [39]). 


Theorem 2.A.27. (Hardy-Ramanujan) 
The functions w and 2 have normal order loglogn, i.e. if f € {w, Q}, 
then for alle > 0 we have 


{nsan-e< AM. <ite} 


log log z = 


LE 
lim — 
£0 TF 


Proof. Let f € {w,Q} and let 


Ay = {n <a aes | > e} ; 


log log x ~ 
Since for any n € Az we have 
(f(n) ~ loglogx)? 2 ¢? - (log log x)’, 
we deduce that 


2. (logloge)[Ael <)> (f(n) — loglogx)? < }°(F(n) — logloga)?. 


néAz nsx 


Using the previous corollary, the result, follows. o 


Here is a very beautiful consequence of the Hardy-Ramanujan theorem, 
which is surprisingly difficult to prove by other means: 
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Theorem 2.4.28. (Erdés) We have 


. <a.b< 
tim Ke-biisabso}l 9 


n00 ne 


Proof. By the previous theorem, there are n? + o(n?) pairs (a,b) with 1 < 
a,b <n and such that (ab) is about 2loglogn. The same theorem shows 
that n? + o(n?) numbers k € {1,2,...,n?} have 2(k)} about 

log log n? = loglogn + log 2 << 2loglogn, 


r 


so the number of numbers of the form ab (with 1 < a,b < m) must be o(n”), 0 


The following difficult theorem (see [28]) refines considerably Hardy- 
Ramanujan's result. It is also known as the fundamental theorem of prob- 
abilistic number theory. 


Theorem 2.A.29. (Erdés-Kac) Let f : {1,2,...} + R be a strongly additive 
function such that the sequence (f(p))p is bounded (p runs over the prime 
numbers) and such that limp.roo Bs(n) = 00. Then for all real numbers a 


lim <H{n < alf(n) — Es(x) < ay/ By(x)}| = ot ee 


Let us end this section and this addendum with another beautiful appli- 
cation of the Turdu-Kubilius inequality, due to Elliott. We let up(n) denote 
the exponent of the prime number p in the factorization of n. 


Theorem 2.A.30. There exists an absolute constant C > 0 such that: 


1) For allx and alla, €C 


2 


s pt De lin z ( ‘a ‘) So an <Cz)> lan|?. 


pia nic ASE nse 
wp(nj=k 
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2) For all a and all complex numbers an 
2 


Sp Son - 2 om < CaS an). 


pSe inka nse nsa 
pin 


Before looking at the proof, the reader is strongly advised to compare this 
theorem and theorem 2.4.15, 


Proof. We will prove only the first part, as the same argument yields the 
second part. Observe that Turén-Kubilius’s inequality can also be written in 
the form 

2 


iss S> fo") (rsa) ~ ( ~ :) x) <cy Fo? 


k=l |[pY<n pren P 


This suggests defining for j,k € [1,n) the quantity 


i\l : 
Og = (toy = (: = 3) ) Vi 


if j = p“ for some u and some prime p and ajz = 0 otherwise. Given any 
sequence ¥,...,Y¥, of complex numbers, we can certainly define an additive 


previous inequality can be written 


2 
nin 


n 
SoS easy) < eC SF type? < ne 7 lyy 


Kerk | je psn j=l 


Applying the duality principle 2.4.10 and unwinding definitions, we obtain 
the desired inequality. (8; 


The previous theorem plays a key role in Hildebrand’s “elementary” proof 
(see [40]) of an extremely difficult theorem of Wirsing [85], [86], that confirmed 
a long-standing conjecture of Erdés and Wintner. 
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Theorem 2.4.31, (Wirsing) Let f : {1,2,...} + [-1,1] be a multiplicative 
function (i.e. f(ab) = fla) f(b) whenever ged(a,b) = 1). Then 


evists. Moreover, M(f) =0 if 


1-f) 
LE 


To see how subtle this theorem is, take Mébius’ function for f: it is fairly 
elementary that )°,, 7 = 00, so the previous theorem yields 57,,<, (nm) = o(z). 
But it is elementary to prove that this is equivalent to the prime number 
theorem! Even though much easier than Wirsing’s original proof, Hildebrand’s 
proof is still rather technical and we refer the reader to his article {40} for 
details. 


Chapter 3 


Look at the Exponent 


3.1 Introduction 


If p is a prime, we define the p-adic valuation map Uy: Z + NU {oo} by: 
Up(0) = oo and for n # 0 we have vp(n) = k if p* divides n, but p*+? does 
not divide n. More concretely, for n 4 0, Up(n) is the exponent of the prime 
number p in the prime factorization of n, The unique factorization of integers 
into powers of prime numbers easily yields 


up(ab) = up(a) + ¥p(b),  up(a +b) > min(vp(a), vp(6)), 


with equality if vp(a) # vp(b). The first property allows us to extend this map 
Up to Q by vp ($) = vp(a) — vp(b) for all nonzero integers a,b. It is an easy 
exercise to check that this is well-defined (i.e. if ¢ = §, then we get the same 
value for vp ($) and vp ($)). 


We will frequently use the following easy properties of the p-adic valuation: 


vp(gcd(x1,22,...,%n)) = win vp(zi), 


vp(lem(a1, 22,..-,2n)) = ee vp(ai). 
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3.2 Local-global principle 


A very useful idea when dealing with divisibilities (or even equalities con- 
cerning arithmetic objects) is the local-global principle: if a and 6 are nonzero 
integers, then a divides 6 if and only if for all primes p we have up(a) < vp(b). 
This “local-global principle” is the simplest. of a series of such statements, 
concerning the way in which the arithmetic of integers is governed by the “be- 
havior at each prime.” Some of these statements are very deep and most of 
them are an active area of research. Here are some other such examples: a pos- 
itive integer is a perfect square if and only if it is a perfect square mod p for all 
primes p (this is already a quite serious result, using the quadratic reciprocity 
law). Another famous example is Hasse-Minkowski’s local-global principle: if 
a; are nonzero rational numbers, the equation ay2? + agx3 + -+- + anz2 = 0 
has nontrivial rational solutions if and only if it has nontrivial real solutions 
and nontrivial p-adic solutions for all primes p. This is a deep theorem and we 
refer the reader to the addendum concerning p-adic numbers for more details. 

We start with some easy applications of the local-global principle. 


i. Prove the identity 


____lem(abo)? __, __sed(a bye)? 
lem(a, 6) lem(6,c) -Iem(e,a) ~_ged(a, b) - ged(b, c) - ged(c, a) 


for all positive integers a, b,c. 


USAMO 1972 


Proof. Fix any prime p and let x = vp(a),y = up(b) and z = vp(c). The p-adic 
valuation of the left-hand side is 


2max(z, y,z) — max(z, y) ~ max(y, z) ~ max(z,2), 
while that of the right-hand side is 
2min(z, y, 2) — min(x,y) ~ min{y, z) ~ min(z, x). 


We claim that these two quantities are equal. Since the two.expressions are 
symmetric in x,y,z, we may very well assume that « > y > z. Then we need 
to prove that 22 -2—y—a =2z-—y~z-~ z, which is clear. 
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Remark 3.1, Another natural proof uses the identity 


wy 
lem(z, y) = — 
gcd(x, y) 
and its analogue 


zy ged(z, y, 2) 


lem(z, y, z) = “ ; 
(1,2) ged(a, y) ged(y, z) ged(z, x) 


Plugging in these values immediately yields the result. 


2. Let a1, a2,...,a% and by, b2,...,, be positive integers such that a; and 
b; are relatively prime for all 7 € {1,2,..., k}. Prove that 
aym am apm 
ged (4, “hb ar “um ) = ged(ay,a2,...,ax), 
where m = lem(bi, be, ..., bg). 


IMO 1974 Shortlist 


Proof. Fix & prime p and let a; = vp(a;) and yj = Up(b). By hypothesis, we 
have min(x;,%) = 0 for all i and we need to prove that if 


z= max(y1,¥2,.--5 Ye), 


then 


min(x) ~ yy + 2,02 — yo + 2,...,4% —Yyet+2) = min(a@},£2,...,2%). 


Note that 2;—y, +2 > aj for all i, so certainly the left-hand side of the previous 
inequality is at least the right-hand side. On the other hand, if z = 0, then this 
forces all y; to vanish and in this case the equality is clear. So, we may assume 
that there is some j such that y; = z > 0. Then y= Oand 23 ~ yj, +2 = 0. 
making the equality clear again. . 
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The next problem is quite challenging and requires some preliminaries. 
A very common situation in analytic number theory is the following one: we 
have two maps f and g and we assume that g(n) = >, ajn J (@) for all n. Mobius 
found a very nice inversion formula, which expresses f in terms of g. Define 
the Mobius function » by #1) = 1, p(n) = 0 if n is not squarefree and 
wlpipe-+- pe) = (—1)* if pi pe,..., pe are distinct wee numbers. Then the 
Mobius’ inversion ieaule reads f(n) = vy, # (4) 9(d). The main ingredient 
in the proof is the fact that $74, #(d) = 0 for all n > 1 and it equals 1 
for n = 1. This is immediate, since if pi, p2,...,p, are the different primes 
dividing n, then for n > 1 


So u@ =1- a mi) + 3° wlpipj) += 


dire i<j 


-()-Q)—= 
= (1-1) =0, 


by the binomial formula. Using this observation, we can write 


Sx) 9@ = Les (5) 
din dln 
= e@- YO Ka) 


din addin 
=a) YS e@ 

dyjn az 
= f(n). 


The next problem requires the multiplicative version of Mébius’s formula, 


namely if g(n) = Nan £(@); then f(r) = Tlan 9(4) yes ). ‘The proof being 
exactly the same as above, we leave it to the reader to fill in the details. 


3. Let (Gn)n>1 be a sequence of positive integers such that 


gcd(am, Gp) = Bged(mnn) 
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for all positive integers m,n. Prove that there exists a unique sequence 
of positive integers (6; )n>1 such that a, = ]] dg for all n. 
din 
Marcel Tena, Romanian TST 


Proof. In the light of the previous discussion, we are forced to define 
= wd) 
by = Il an 
din 


Of course, the hard point is to prove that 6, is an integer. In order to prove 
this, we will first transform the expression defining b,, using the Mersenne 
property? of the sequence (a )n. Namely, let n = p{"p$? ---p?*, then 


The key remark is that the Mersenne property yields the equality 
gcd( an Jier = a,,.2. 


for all subsets I of {1,2,...,&}. Therefore we can write 


by = Teed{oz.03) 
i 
Jax Hecd(az,a2,a2 


Finally, the inclusion-exclusion principle coupled with the local-global princi- 
ple easily yields the formula 


Tz: Tl gcd(a3, x5, 2%) 
eae : d esse lem(xy,29,...,2n) 
Thc; ge (zi, 23) Tl ee (wi, 25, Xk, x) 
for all nonzero integers #1, 22,...,2n. Using this observation, we deduce that 
Q 
bn = ee 
lem [ana Zz, an] 
PY PK 


See chapter 10, the introduction to problem 10. 
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which is clearly an integer (by the Mersemne property, if m divides n then @m 
divides a). The result follows. Oo 


Proof. Let p be a prime. Note that if a sequence (an)n>1 has the hypothesized 
property, then so does a, = p’r(m), Further, if the desired conclusion holds 
for all such al, then it clearly holds for am. ‘Thus we may assume a,, = p“" 
and we see that the sequence (un)r>1 consists of nonnegative integers an 
satisfies min(tn, tn) * Uged(m,n): 

Let 0 <r) <r <-+ be the values actually taken on by the sequence 
(tn)nz1- For any two elements m and mi of the set S, = {7 : Um = Te} we 
have Ugea(mm') = min( thm; tm!) = Te and hence ged(m,m’) € 5. Therefore Sk 
has a unique minimal element m, and all elements of S; are multiples of mp. 
Note that if j > &, then min(um,,tm,) = min(rj,7x) = Tk) SO gcd(mj, Mx) = 
mp OF Mylmn;. Thus in the sequence 1 = mj,™M2,m3,.-. each term divides 
the later terms. Now suppose mx|m and mai 1m. Then gcd(myz,m) = 
Mp SO MiIN(tmy,s Um) = Tk OT Um 2 Tk However ged(my41,7) < Mp41 $0 
min(tmngirs tm) < Tk and hence tm < Pet1- Since the r; are all the values 
taken on by (t#,), we conclude that tn = Tr. 

To recap, the sequences ry < rz < +++ and my, 72,.-- determine tn 
uniqiely. If k is the largest index for which m,|m, then up = rp. Define 
by = BT, by = PTH? for & = 2, and b,, = 1 if m is not one of the my. 
Then we immediately see that if k is the largest index for which m,,|m, then 


k k 
IL = Tom: =p" lew == pth = p\™ = dm. 
i=l i=2 


dim 


3.3 Legendre’s formula 


Legendre discovered a very beautiful and useful formula for v,(n!): 
n n— 8p(n) 
vp(nl) = >> [5] = 2 
sl p-l 
where s(n) is the sum of digits of m when written in base p. The proof of 


the first equality is easy, since there are [3] - [str| positive integers <n 
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such that vp(z) = 7. The second equality is a direct computation left to the 
reader. The purpose of this section is to present some nice applications of this 
formula. 

; We present two solutions for the next problem: one uses the local-global 
principle combined with Legendre’s formula, while the second one is more 
exotic, but very powerful. 


4. Show that if n is a positive integer and a and 6 are integers, then 
1 
wale + b)(a + 2b)--- (a+ (n~ L)b)b™™ eZ. 


IMO 1985 Shortlist 
Proof. We will prove that for any prime p < n we have 
Up(n!) < vp(a(a t+ b)--+(a-+ (n — 1)djo"“}), 


which is enough to solve the problem. If p divides 6, things are rather clear, as 
up(nl) < n—1 while v,(b"")) > n— 1. So assume that p does not divide 6. But 
then there are at least [n/p] multiples of p araong a,a+b,..,,a+(n—1)b (since 
among a,a + 6,...,a-+ (p— 1b there is at least one multiple of p, the same 
for a+ pb,...,a@+ (2p —1)b,...), at least [n/p] multiples of p? and so on. So 
the p-adic valuation of a(a +6) ---(@+(n—1)b) is at least [n/p] + {n/p?| +-- : 
and this is exactly the p-adic valuation of n!, by Legendre’s formula. 0 


Proof. Consider the matrix A = {a;j}icigsn where aj; = 
claim that we have > 


det(A) = -a(a + b)--- (a+ (nm — 1)b) 


ni 


To prove this, note that one can evaluate any determinant of the form 


Pi(ai) Pi(az) + Pian) | 
Po{x1) P2(rq) tee Px{@n) | 


Pali) Paltta) + Pa(arn) 
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where P; are polynomials of degree at most n — 1, by multiplying the matrix 
of the coefficients of the P;’s by the Vandermonde matrix aes and then 
taking the determinant. Using this observation, it is easy to prove the above 
formula. 

Now, since det(A) is an integer (as it is a polynomial expression with 
integer coefficients in the entries of A), it follows that for any p relatively 
prime to , the p-adic valuation of a(a + b)---(a@ + (n — 1)b) is at least that 
of n!. As for primes dividing b the problem is easy (since up(n!) < n — 1), the 
result follows. Q 


5. Prove that for all integers a,b with b # 0 there exists a positive integer 

n such that ve(n!) = a (mod 8). 
K6MaL 
Proof. If so{n) is the sum of digits of n when written in base 2, we need to 


find n such that n — s9(n) = a (mod 6). Choose n = 27! +27? +---+27* with 
ay < 22 < +++ < ey. Then so(n) = k and we need to ensure that 


ger _.{4-.-+27%—1=a (mod 6). 


Write 6 = 2c with odd ¢ and choose r < x) < @ <-++ < a, such that 
a; =1 (mod ¢(e)). Then 2% - 1 = 1 (mod c) and so 
281 Lp. + 2% —L=k (mod c). 
Also, we have 
Qe + 2 — 1 —k (mod 2”). 


Thus, it is enough to choose k such that k =a (mod c) and k = ~a (mod 2), 
which is possible by the Chinese Remainder Theorem. Oo 


Remark 3.2. This problem admits a vast generalization, that appeared on the 
IMO Shortlist 2007. Namely, we can prove that for any positive integer d and 
positive integers 6), 2,--.,0m there exist infinitely many positive integers n 
such that dln ~ s5,(n) for all i. Here s,(m) is the sum of digits of n when 
written in base b. 
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Proof. Let cp{n) be the number of digits of n in base & and consider the 
sequence defined by 


It-maxi<j<m ¢%, (a3) 


ag = dbyb2-++bm, aj41 = 
It is trivial to check that d divides a; for alt i and that 


Sp, (dig + Gag ++ + iy) = 80,(4i,) + 80, (4:2) +++ + $b, (ai) 


for any distinct numbers 1, i2,..., 4;. 
Since the m-tuples 


Si = (8p, (ai) (mod d), s»,(a;) (mod d),...,8,,,(a;) (mod d)) 


take only finitely many values, the pigeonhole principle yields the existence 
of an m-tuple S that repeats infinitely often, say S = S;, = S;, = +++ for an 
infinite increasing sequence ig, i;,.... 

Let 


Ch = Bigg t Fiangs Foo Cian ant- 


We will prove that for all k we have d\c, ~ S8b;(Ck) for all j, which will end 
the proof. Since clearly djcg, it remains to check that d divides sy, {c,), which 
follows from ‘ 


3b; (cx) = 5b; (Gia4) + 8b; (Gigees Jae te 8b; (Qigpa—r) 


and 


8b; (Giae) = $b; (Qiang) = + = 8b; (Giany a1) (mod d). 


A trickier combination of the local-global principle and Legendre’s formula 
can be found in the following problem, which implies the classical inequality 
lem(1,2,...,2) 2 2"-!. This nice observation as well as the problem ap- 
peared in [31]. Amusingly, the same result appeared much before [31] in the 
same journal, in the form of a proposed problem! There is also an elemen- 
tary, but more difficult proof of the fact that lem(1,2,...,n) < 3” for all n, 
see for instance [38]. Using the prime number theorem, one can prove that 
jem(1, 2,...,7) behaves like e”. 
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6. Prove the identity 


(n+ 1) Ilom (@). Ges Gy = lem(1,2,...,.n +1) 


for any positive integer n. 
Peter L. Montgomery, AMM E 2686 


Proof. We will prove that for any prime p the expressions in the two sides 
have the same p-adic valuation. Note that 


ven(() nC) 


Let & denote the number of times p divides the right-hand side of the equality 
from the problem statement, so p* <n +1 < p**!. Taking i = p* ~ 1, we 


obtain that - 
Up (m+n(7)) > uplitl) =k, 


which shows that the p-adic valuation of the left-hand side is at least k. The 
opposite inequality is more delicate. Fix 0 < i <n and note that Legendre’s 
formula gives 


(C2)-E> = PA-EALE 


r>i 


(3.) 


Now, since 
[a + 6] ~ a] — [6] = [{a} + {}] € {0,1} 


for any real numbers a,b, we have x, € {0,1} for all r. Moreover, we have 
t, =O if r > k, since in this case p" > m+ 1. The crucial point is that for all 
r < Up(i+1) we also have xz, = 0. Indeed, writing i+ 1 = pu for some integer 


u, we have 
n+l n+l =0 
y= rr ~ur ~ul =0. 
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Putting these remarks together yields the inequality 


IE Sk-w,(i+ 1), 


rel 


which, combined with (3.1), yields the estimate 


ty (@+o(77t)) <k, 


establishing the opposite inequality. i) 


We continue with a rather tricky diophantine equation. 


7. Solve in positive integers ¢20°7 — 2007 — x1 — yf. 
Romania TST 2007 


Proof. We claim that there are only trivial solutions 2 = y. Suppose that 
x > y ts a solution of the problem. We will distinguish two cases. 

In the first case, assume that y < 2007. If y = 1, then 7207 = x! and 
trivially x = 1 (otherwise x — 1 would divide 27, so 2 —1 = 1, which is 
clearly not possible). Thus y > 1 and we may choose a prime ply. Then p 
divides y?°7, z!, y!, so that plz. But then 


2007 < vp (27% — y?7) = vp(y'(a/y! — 1) = uly!) < y, 


a contradiction. 

Now, assume that y > 2007. Then «—y is a multiple of any prime p smaller 
than 2007 such that (2007,p — 1) = L. Indeed, if p is such a prime, then p 
divides 12007 — 2007 — 1 yl, Tf p divides z, then clearly it also divides y and 
so it divides x ~y. If not, since p divides «?-! ~y?~! and ged(2007, p— 1)=1, 
it follows that p divides « — y. We deduce that « > y+ 2007 in this case. But 
then 


zl — yl = yl(a!/y! ~ 1) > 2007!- a(@ — 1) --- (@ — 2006) > 72%, 


again a contradiction. 
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The next problem is also tricky. 


8. Prove that for all positive integers n different from 3 and 5, n! is divisible 
by the number of its positive divisors. 


Paul Erdds, Miklos Schweitzer Competition 


Proof. Since the number of positive divisors of n! is precisely Thp<al i+vp(n!)), 
it suffices to prove the following 


Lemma 3.3. Let P, be the set of prime numbers less than or equal to n. For 
alin # 3,5,7 there exists an injective map f : Py > {1,2,...,n} such that 
1+ u(r!) divides f(p) for all p € Pa. 


Indeed, if this is true, then [],,<,,(1 + up(n!)) is a divisor of T],cp, f(p), 
which divides n!, because f is injective. Thus the problem is solved for n 4 
3,5,7. For n = 7, we can check by hand that the result holds. 

It remains to construct f. For p < \/n simply choose f(p) = 1+ vp(n!). 
Note that f(p) < n follows from Legendre’s formula and for p < q < Jn 


we have [3] 2 [2] for all 7, the inequality being strict for 7 = 1 (because 


3 2 py > 1). Por the other primes, we will define f by induction. Assume 


we defined f for all primes ¢ < p, where p € P,,p > s/n is given. There are 


1+ [2] 
multiples of 1+ v,(n!) (we used the fact that p > /n to obtain vp(n!) = [2| )e 


If we manage to prove that this number is greater than 7(p) ~ 1, which is the 
number of occupied values for f(p), we are done, since we can take for f(p) 
any of the remaining values. However, note that m(p) < BH with equality 


n 
l+Up 


only for p = 3,5,7. Let us evaluate |=tal: Write n = kp +r for some 
Pp 


O<r <p, so that 


n kpir rp r—p p 
= A ay A ae ean ee 
1+ [3] esa p+ [Tot] >e Lee? ae 


nm 
Pp 
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So, for k > 2 we have 


Feseea >a(p)~1 


and we are done. For & = 1 it remains to see if we can ensure that [3] > ae 
But this trivially holds for n > p. So, the only obstruction is n = p and 
a(p) = a. As we observed, this only happens for n = 3,5,7, which is 
excluded by the hypothesis. 


For the next problem, we will need some basic estimates about prime 
numbers, We refer the reader to the addendum 3.A for proofs and more 
details. 


9. Let n,k be positive integers such that n > 9*. Prove that (7) has at 
least k distinct prime factors. 


Paul Erdés, Miklos Schweitzer Competition 


Proof. We will prove that for n large enough, 9) is a multiple of a product 
of k numbers that are pairwise relatively prime and greater than 1. This will 
clearly imply that it has at least k prime factors. Define L; = lem(1,2,... ,k). 
The key point of the proof is the following 


Lemma 3.4. Forn > k + Le, (j) is a multiple of [3 gar. 


if this happens, we are done, because the numbers 
are pairwise relatively prime and greater than 1. 

To prove the lemma, we will compare p-adic valuations for all primes p. 
Note that the lemma is equivalent to 


noi nog 
ged(n~t,Lg)? ged(n-7,Ly) 


n 


ki ll ged(i, Lx). 
tmn—k+i 
Thus, we need to prove that for all primes p we have 


n 


up(k!) SSF up(gcd(é, Lx))- (3.2) 


ten-kth 
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Let r = vp(Lx) = flog, k]. For all i < 7 we can find at least [4] multiples of 


pe among the k consecutive numbers n ~ k + 1,n—k+2,...,n. ‘Also, ifuisa 
multiple of p' with i < r, then so is ged(L;,, u). We conclude that (3.2) follows 
from Legendre’s formula, as the only nonzero terms on the left-hand side are 


of the form [4 with i<r. 
Finally, it remains to prove that k + Ly < 9*, Note that 


pe T[ oe = Il p II plot ®l < 4k. II k 


pSk povk  psvk 


where here we used theorem 3.4.3. Thus 
Ly < kVE ak <9, 


the last inequality being easy to prove. Oo 


3.4 Problems with combinatorial and 
valuation-theoretic aspects 
The problems in this section are fairly elementary, but none of them is 
easy. 


> 2 and let a7,@9,...,a% be positive integers, not all of them 
equal. Prove that there are infinitely many prime numbers p for which 
there exists a positive integer k such that 


10. Let n > 


plak + ak +--+ ak. 
Tranian Olympiad 2004 


Proof. By dividing all a;’s by their greatest common divisor, we may assume 
that they are relatively prime. Suppose that there are only finitely many 
primes p1,p2,...,py such that all prime factors of af + ak pet ak (where 
k varies over the positive integers) are among p1,p2,.--,PN- 
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Assume that among @1,@2,...,@, there are 6; numbers not divisible by 
pi. Since a4,42,...,@, are relatively prime, we have 6; > 1. Consider 


k= aTLo( aa 


Since k > up,(bj) + 1, we have p; vee O41 


la’ whenever pjjaj;. Using this and 
Euler’s theorem, we infer that 


ak + ak yo. + ak =b; (mod P; pee ba)tTy 
Hence 
up, (a® take yg aby = Up, (bi) 
for all 7. Since all prime factors of af + af +-+-+ ak are among py, p2,..-, PN; 


we deduce that 


Per a), pivalea) spew Ow) 


at + ay ++ ak = py 


Now, at least one of the a,’s is greater than 1, thus 


N 
a paht- tak >>> Tp pre. 
4 Di 
tml 


The two relations are clearly contradictory and the problem is solved. Q 


A, classical problem of Erdés is the following: if a;,,a2,. +:@n41 are dif 
ferent positive integers not exceeding 2n, then one can find i Z j such that a; 
divides aj, The idea is very simple and beautiful: the largest odd divisors of 
the a;’s form a sequence of n+ 1 odd numbers between 1 and 2n — 1, so there 
must be two equal terms in this sequence. But then the corresponding a; and 
a; have a quotient which is a power of 2. The next problem is a variation on 
this classical gem. 
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11. Let f(n) be the maximum size of a subset of {1,2,...,} which does 
not contain two distinct elements i,j such that 7/27. Prove that there 
exists a constant C > 0 such that for all x we have 


[rem ~- = < Clon. 


Paul Erdés, AMM E 3403 


Proof. We will actually exhibit the optimal set with the property that it does 
not contain two distinct elements 7,7 with i|27. Defining 


An={aeZl n/&S<a<n, ve{a)=0 (mod 2)}, 


it is easy to see that A, satisfies the conditions of the problem. Indeed, if 
i,j € Ay and i|2j, we must have é{j, since we cannot have v2(i) = v(j) +1 as 
both v2{i) and v2(7) are even. So j/i is either 1 or 2. It cannot be 2, because 
in this case one of v(t), ve(j) would be odd, so it has to be 1 andi = j. 

Next, we prove that this set is optimal, in the sense that it has the maximal 
number of elements among all sets satisfying the conditions of the problem. 
Take any such set A and fix k such that 3 does not divide k and v(k) is even. 
Look at all the numbers &,3k,9k,... and 2k,6k,18k,.... There is at most 
one element of A among the union of these numbers and by definition there 
is exactly one element of A, among them. On the other hand, if one varies 
k, the numbers k,3k,...,2k,6k,... form a partition of the positive integers. 
This clearly implies that A has at most as many elements as Ap. 

Finally, we have to estimate the size of An. The elements of A, are exactly 
the numbers of the form 4/b with 6 odd, j > 0 such that 35; <6 < 7. There 
are approximately 3"j; odd numbers 6 such that ga <6 < ZG, with an error 
at most 2 if 47 < n and error at most 34 for 4? > n. The total error is thus 
logarithmic in n and since 


we have 4s 
|An| = J + O(log), 


finishing the proof. Oo 
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12. Find all positive integers n with the following property: there exist 
natural numbers bj, 62,...,0,, not all equal and such that the num- 
ber (6) + k)(bz +k) +++ (bn +k) is a power of an integer for each natural 
number k. Here, a power means a number of the form <¥ with zy>d 


Russia 2008 


other 6;’s equal to 1. Then for any k we have 
(b1 + &)(b2 +) ++ (by KY = (+ 2)%(b + 1D, 


which is certainly a power. 

So, the real question is to decide whether numbers 6;,b2,...,, can exist 
when n is a prime. It turns out that the answer is negative. Assume that such 
numbers existed and let c1,c2,...,¢y be the set of distinct numbers among 
61, b2,...,6n, with multiplicities m1,mo2,...,mn.- By assumption, we have 
N > 1 and clearly n = mi + mg +--+ +m. Moreover, we know that, for 
any k, the number (ce, + k)™ (co + &)"2--- (cy + k)™* is a perfect power. 
The key point is to choose numbers & for which we can find distinct primes 
Beri ++++pw such that vp,(cj +k) = 1 if i = j and 0 otherwise. In this case, 
i 

(ey +B) (eg 4 RY (ey + RYN = ot 


for some ,y > 1, we have YUp,(z) = mj, so that y divides all m;. But then 
y divides their sum, which is n and since n is a prime, it follows that n = y. 
Thus n = y will divide all m; and this obviously contradicts the fact that 
N> Land mi +mat+---+my =n. 

Thus, we are done if we can find distinct primes p1,p2,...,py and k such 
that vp,(cj + k) = 1 if i = 7 and 0 otherwise. This is very simple: first, 
we choose some distinct prime numbers 1, po,...,pn sufficiently large, say 
not dividing any of the numbers ¢; — ¢; with i # j, and then choose k such 
that k + cj = p; (mod p?) for all i. Such k exists by the Chinese Remainder 
Theorem. Of course, vp,(k + ¢;) = 1 and for j # i we cannot have pile; + k, 
since otherwise p; would divide c; — ¢;, contradicting the choice of p-Thus, 
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such k satisfies all desired conditions and the answer to the problem is that a 
i J 
must be composite. c 

A nice mixture of valuation-theoretic arguments and pigeonhole principle 
can be found in the following problem. 


13. Let a be a positive integer. Prove that the set of prime divisors of 2?" +a 
for n = 1,2,... is infinite. 
Iranian TST 2009 


Proof. Assuming the contrary, let p1,p2,...,pn be such that all prime factors 
of 22" +@ are among p1,p2,...,pn for all n. Pick a large number r such that 
ar > a?" + and ng such that 27"° +a > (pipo---pw)’. Then for all n > no 
we have 


N in 
gn vp, (2* +a) 
(pipa--pwy <2" +a = [[p;” E 
i=] 


so that we can find 1 <i < N with v,,(2"" +a) > r. This p; depends 
on the choice of n, but among the indices i associated to the numbers n= 
no+i,ng+2,...,n9 +N +1 there will two identical ones. ‘Thus we ee ae 
peje" +a and pier +a for some n > ng, some 1 <ms N + and some 
1<i<.N. But then 2°" = —e (mod pf), so that 2 a” (mod pt) 
andp? ja?” + a. In particular, 


qu i har a -ar pf >a, 
contradicting the choice of r. The result follows. im 
We continue with two more unusual problems. 


14. Let pi,pa,---, pe be distinct prime numbers and let S be the set of 
positive integers all of whose prime factors are among pi, P2,---s Pre If 
A is a finite set of integers, let G(A) be the graph whose set of vertices 
is A, two vertices a,b € A being connected if a — 6 € S. Is it true that 
for all m > 3 we can find A with m elements such that 
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a) the graph GA) is complete? 
b) the graph G(A) is connected with all vertices of degree at most 2? 


Miklos Schweitzer, Competition 2009 


Proof. The answer to the first question is negative. Let p be the smallest prime 
different from pi,p2,..., px and suppose that G(A) is complete for some finite 
set A with |A| > p. Then there exist a,b € A different such that p divides 
a— b, so that @ and b arenot connected. 

On the other hand, the answer to the second question is positive! We will 
construct a set A with m elements a), a2,...,dm such that G{A) is a simple 


path. It is enough to ensure that for all 1 Sn SM, Qazi — Gn € S and 
On+2 ~ An; An4+3—An,... are not in S. But we can choose 


On = Pip2 ++ Pe + (Pip2 +++ Dy)? +++ + (prpa+--pR)”. 
Then clearly Gni, —~ a, € S. On the other hand, for any i > 2 we have 


Up, (Angi — Gn) =2-+1 for all 1 <j Sk. Since dna; — an > (pipa r++ pe)? *t, it 
follows that an4;— a, is not in S and the result follows. 


15. Let m and n be positive integers such that m + lm+2,--- men 
are composite numbers and m > n"™~!. Prove that we can find pairwise 


distinct prime numbers pi, p2,...,py, such that pi divides m + é for all 
lsi<n. 


Tuymaada Olympiad 2004 


Proof. First, we will deal with those é such that m-+i has at most n—1 prime 
factors. For such i choose a prime p; for which py (m+) is maximal (note that 
p; 38 unique with this property), Since m +7 > n®-} and since m + 7 has at 
most n ~ 1 prime factors, we have pir ets) >. We claim that if i 47 and 
m+i,m-+ 7 have at most n-- 1 prime factors, then p; 4 p;- Indeed, assume 
that pj = pj = p. Then min(pte(™+? , peer+i)) divides m +i and m+ and 
moreover is greater than n. But any common divisor of m +i and m +9 
divides j — i and so it is smaller than n. ‘This shows that 73 pi is injective, 
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It is now easy to conclude: make a list with those numbers m +7 having 
at most n—1 prime factors. For each of them, the previous paragraph yields a 
prime factor p; and the associated p,’s are distinct. If all m +7 are in the list, 
we are done, otherwise successively pick remaining numbers and choose one 
of their prime factors which is not among the p;’s or among primes previously 
selected. This is possible at every step, since any m +? not in the list has at 
least n prime factors. fo 


3.5 Lifting exponent lemma 


This section is devoted to some applications of the following useful result: 


Theorem 3.5. (Lifting exponent lemma) Let p be an odd prime and let a 
and b be integers relatively prime to p, such that pla ~b. Then for all positive 


integers 7 
up(a™ ~ b") = up(n) + Up(a ~ b). 


Proof. Consider first the case vp(n) = 0. We need to prove that p does not 
divide “=*", which is clear, as by hypothesis 


a®™ — b* 
a—b 
and p does not divide na"~!, Next, we prove the result for n = p; we need 
to check that p divides exactly once into a?-! +... 4+ 0-1. Write b = a + pk 
for some integer k. Then by the binomial formula we have ' = at + ia’! pk 
(mod 7), so that 


aati 4a py-.-4b") =na™! (mod p) 


ppp pl _ pel 
bl ; = Sat = Ler + ipka??) 
oF i=0 ix0 
-1 
spa? + pk or = pa’) (mod p*). 


Let us apply the case n = p to a? and b"/P (note that they still satisfy the 
hypotheses of the problem) to get vp(a” —b") = 1+ vp(ar/? — b"/P)_ The result 
follows now by an immediate induction on vp(7). im] 
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The reader might wonder what happens for p = 2. There is of course a 
version of the theorem for p = 2, but the formula is more complicated. The 
proof is however much easier. 


Theorem 3.6, Let x,y be odd integers and let n be an even positive integer. 
Then 
2 


woo" — 9") = oy (= =H) + tw) 


Proof. Write n = 2ka for some odd number a. Then 
uy" = (@* — y\(a* + y?)(a 4 y*) oan ("a Pe ya), 
Now observe that if u,v are odd numbers, tien u? +? = 2 (mod 4), Thus 
vox" — y") = ue(a** — yy) +k - 1, 


Finally, since a, z,y are odd, it is easy to see that eve is odd. The result 
follows. : 0 


An easy consequence of these results is the following estimate. It is much 
weaker than the previous theorems and can be proved directly by induction, 
too. 


Corollary 3.7. [f a,b are integers and p is any prime dividing a~ 6, then for 
alln we have 
Up(a® — B”) > vp(a — b) + vp(n). 
The next problem is an immediate consequence of this corollary. 
16. Let a,b,¢ be positive integers such that cla° ~ b°. Prove that =e. 


I, Niven, AMM E 564 


Proof. First of all, note that ae is an integer, so the statement makes sense. 
Let Be a prime dividing c. We will prove that up(e) < up (=¥). If p does 
not divide a ~ b, this follows from the hypothesis that c divides a° ~ b°, so we 
may assume that p divides a ~ 6. But then everything follows from corollary 
3.7. 123) 
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The next problem appeared as a Chinese TST 2009 problem, but the 
result is much older (see [61]). 


17. Let n be a positive integer and let a > b > 1 be integers such that 6 is 
odd and 8" |a" —1. Prove that a’ > e 
M.B. Nathanson 


Proof. Take any prime factor p of b. Since b is odd, we have p > 2. Note that 


We deduce that 


rt te 
a> Pat > pve) > PL > Se 


ma tom x 


rh n 


and the result follows. aa 


Remark 3.8. Using a generalization of the deep Thue-Siegel theorem, Mahler 
proved in [48] the following result: if @,b,u,v are nonzero integers with u > 
v > 1, then there are only finitely many positive integers n such that 


au” =b (mod v"). 


It is quite rare to see two very similar problems at the IMO; nevertheless 
the following problem appeared in weaker forms at the IMO in 1990 and 1999. 


18. Find all primes p and all positive integers n such that n?—! divides 
(p—1" +1. 
After IMO 1990 and 1999 


n= 2. From now on, we assume that p > 2. If n is even, then 4 divides n?—!, 
but it does not divide (p — 1)" +1, a contradiction. So, m is odd. Let q be the 
smallest prime factor of n. Since 


(p- 1)" = 


“land (p—1)%!=1~ (mod q), 
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the facts that n is odd and ged(n,qg~ 1) = 1 imply that p— 1 = —1 (mod 4), 
or p = g. 
By the lifting exponent lemma (using that n is odd) we have 
(Pp ~ Lup) = up(nP) < vp((p~ Y" +1) = 1+ a(n). 


Thus (p ~ 2)up(n) < 1. In particular, p = 3 and v,(n) = 1. Write n = 3a 
with ged(a,3) = 1 and observe that a? divides 8¢ +1. We claim that @ = 1. 
Otherwise, let r be the smallest prime factor of a, so that 


8? = -1and8""!=1 (modr), 


whence 8 = —1 (mod r) as ged(a,r ~ 1) = 1. But then r = 3, which is 
impossible. It follows that a = 1 and n= 3. 


Remark 3.9. Another interesting problem that can be solved using the same 
ideas is the following one: find all positive integers a and 4 such that a? divides 
i 1, 


19. Find all positive integers a,b,c such that 
(2-138 -1) = ed. 
Gabriel Dospinescu, Mathematical Reflections 


Proof. We will only give the key ideas, since the computations are a bit tedious. 
Take a solution (a,6,c) with ¢ > 3 and note that a is even, since 3 divides 
2*—1, Also, as 4 divides cl, 4 must divide 3°~ 1 and so 6 is even. Then, using 
the lifting exponent lemma, we deduce that 


e— $3(c) 
2 


Similarly, by writing 6 = 2*r with r odd, we have 
BI 
v2(3° — 1) = v Ge - ») + v2(b) 


= 02(4(9"! + 9°? +--+ 1) + v2(b) 
=2+ v2(b). 


= vg(el) = vg(2? — 1) = 03(44/? —1) = 1 + ua(a). 
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Thus 


¢— 89(c) = ve(cl) = v2(3° — 1) = 2 + v2(b). 


We deduce that 3§-2 
a > 3% > gir trloss(—1 BE 


c 
and 


ge-3 


b> gue(b) > gor tnloggem2 


From here it follows that 


ade (re 


It is not difficult (even if it is quite tedious) to check that this does not hold 
for any ¢ > 12. We deduce that in any solution we have c < 11. We can easily 
exclude the cases c € {8,9,10,11}, since in this case we obtain ¥2(b) aay 
forcing b > 32, which is too large. For c € {4,5,6,7}, we have cl > 3 -1 for 
only one value of } with v2(b) = c~s2(c)~2, so we simply check to see if there is 
a compatible value of a, Combining these with the obvious solutions for ¢ < 3, 
we end up with (a,b,c) € {(1, 1,2), (2, 1,3), (2, 2, 4), (4, 2, 5), (6,4, 7)}. o 


b 


20. Let @ be a fixed positive integer. Prove that the equation ni = a’ ~ a° 
has only finitely many solutions (n, b,c) in positive integers. 


Chinese TST 2004 
Proof. Let p be an odd prime not dividing a. Then by the lifting exponent 


lemma we have 
vp(a™ — 1) S$ up(a?? ~ 1) + vp(n). 


Taking n = 6 ~c and noting that up(n!) > 3 — 1, we conclude that 
5 n 
vp(b — €) > up(n!) ~ vp(aP-! — 1) > a K 


for some constant K, independent of n. Letting « = p~* > 0, we conclude 
that b> ep/? for all n. Thus 


n®>nl=a—ao >a > aP”. 


i 
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Taking logarithms, we deduce that n is bounded in terms of a. 
Since c,b—¢ < nl, the conclusion follows. a 


We continue with a very beautiful, but hard problem. 


21. Let x,y be relatively prime integers greater than 1. Prove that for in- 
finitely many primes p, the exponent of p in 2?) — yP~1 ig odd. 


Barry Powell, AMM E 2948 


Proof. Choose any integer k > 2. It is a well-known result of Fermat that 
the equations at + 64 = c? and a4 + b4 = 2c? have only trivial solutions, 
Therefore a?" 4. yn is neither a perfect square nor twice a perfect square, 
whence there is some odd prime p such that up(22*°" + 2") is odd. Since 
ged(x,y) = 1, p cannot divide xy. Since p divides x2" — y?* and does not 
divide 22°”? ~12""' | the order of z/y mod p is 2* and 2* divides p—1. Hence 
the lifting exponent lemma. gives 


-1 


vp(a?~? — yP-t) = up(a* -y")= vo +P "\s (mod 2). 


The result follows by taking successively k = 3,4,... and observing that the 
prime p associated to & in the previous discussion satisfies p>1+2*, 


Proof. We will argue by contradiction: assume that there exists N > |x -y| 
such that for all primes p > N’ we have 


up(a?? — yt) = 0 God 2). 


We claim that for all p > N the number =< is a perfect square. Take a 


prime factor q of £=#_ Since q does not divide z (otherwise q divides y, 
contradicting the fact that ged(x,y) = 1) and since g divides 2? — y?, the 
order of x/y mod q is 1 or p. If q divides x — y, then q divides px? and so 
q = p, contradicting the fact that p > N > |x ~ y|. So the order of a/y mod 
q is p and so p divides ¢ ~ 1. Then the lifting exponent lemma iinplies that 
tga} — yl) = ug(a? ~ yP) is an even number, finishing the proof of the 
claim. 
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Next, take a prime r = 3 (mod 4) dividing 4ry — 1 and (using Dirichlet’s 
theorem) take a prime p = —1 (mod r ~ 1) larger than N. Thus there exists 
z such that 


1 
a 4 (modr), 


i.e. —1 is a quadratic residue mod r, a contradiction. 


3.6 p-adic techniques 


For more details on the results and techniques used in this section, the 
reader is (strongly) advised to read the addendum of this chapter, which is a 
modest introduction to p-adic numbers, Before passing to the next problem, 
let us recall the notion of congruence for rational numbers. Let p be a prime. 
The localization of Z with respect to the ideal pZ is the set 


Zip =F € Qlged(a,b) = 1, god(b,p) = 1}. 


It is easy to check that Z;,) is a subring of Q. If a,b € Ziq), we write a = b 
(mod p’) if a--b € p!- Z). This is equivalent to the fact that the numerator 
of the fraction a ~ 6, when written in lowest terms, is a multiple of p’. It is 
easy to sce that this is an equivalence relation, satisfying the usual properties 
of congruences over Z. Moreover, we | have a natural morphism f of rings 
Zq) > 2/pt, sending § to a- Le * being the inverse of b mod p). Then 
for z € Zq) we have « = 0 (med p) if and only if f(x) = 0. This observation 
is very useful when trying to prove congruences for rational numbers. For 
instance, let p be a prime greater than 3 and consider the element 


pee 
k=1 Be 
of Zgy). We have 
ro pod 
f(z) = YO = hI = 0, 
ket a | 
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since the map z -> z~! is a bijection of (Z/pZ)* and 


3 pee eS re) 
kml : 


is a multiple of p. Thus the numerator of the fraction Sa =i > When written 
in lowest terms, is a multiple of p. The next problem is a variation on this 
idea combined with some ideas from p-adic analysis. 
y 22. Let p > 5 bea prime. Prove that p* divides the numerator of the fraction 
pi pal 
1 1 
2 te B 
k=1 k=1 
when written in lowest terms. 
Gabriel Dospinescu 


Proof. The first step is to note that 
Ses Tag 
=e “(Et Pp “*) ip my 
Thus, it is enough to prove that 
pol 1 , 
> Ge Ht z) =0 (mod p’) 


Now, the crucial remark is that in the field of p-adic numbers we have the 
convergent expansion 


= 1 oe ae eee PP 
kp—k) ~~ R1-F a(i+t+B+-). 


By truncating at level p? we obtain the congruence 


I Pp 9 
wy B =-g- qa (mod p). (3.3) 


118 Chapter 8. Look at the Exponent 


Of course, one does not need p-adic numbers to find or check the previous 
congruence, since obtaining the appropriate polynomials in p and & and vali- 
dating that they work are each formal algebraic matters. The p-adic approach 
simply gives a straightforward route to both, here. 

Using (3.3), it remains to prove that 


poi 1 poi 1 
Ss B +p>, p= (mod p). 
kel kel 


We will actually prove that 


pri pol 


ie =0 (mod p’), te 0 {mod p). 


k=1 kel 


The same argument as in the preliminary discussion yields 
=S°k4=0 (mod p), 


the last congruence being established either by using the existence of primitive 
roots mod p (which makes the corresponding sum the sum of a geometric 
progression with ratio g4, where g is a primitive root mod p) or simply by 
using explicit formulae for this kind of sum. In order to prove the other 
congruence, note that 


Ly, Boe B+(p-8? 8 
seen det yt 


(mod p*), 


80 


The result follows. Q 


The following problem is a generalization of a classical congruence due to 
Fleck. For the reader not familiar with p-adic numbers, it is worth reading 
the addendum 3.B before attacking the proof. 
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Y 23. Let p be a prime and n,s positive integers with n > s+ 1. Prove that 


pt divides 
ae (t 
> 1) eG), 


O<ksn 
alk 


where d = [5274]. 


Gabriel Dospinescu, Mathematical Reflections 


Proof. Fix a primitive root of unity z € C of order p. The field Ki = Q(z) is 
an extension of degree p ~ 1 of Q, as the minimal polynomial of z is 


Xela XP EX +1. 


Choosing a prime (ideal) 8 above p in the ring of algebraic integers Ox of 
K and completing §-adically, we obtain a valuation vp, on K (seen inside its 
8-adic completion), which extends the usual p-adic valuation on Q and such 
that vp(1 ~ z) = aie ‘To prove the last equality, note that ie is a unit in 


Ox for all i relatively prime to p, so vp(1—z) = tp(1— 2"). Since we also have 


pol 7 
[[a-#) =p. 
i=1 
the result follows. 
Note that 
arse 
= Fe 
’ Ss Zz 0 
ged 


if k is not a multiple of p and 1 otherwise. We deduce that 


St (antes (;) os US 5-2) (;): 


F=0 k=0 
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Now, let n - s~1=d(p—1) +r for some 0 < r < p~ 1. We will prove that 


olor) 


for all 0 <j <p— 1. This will imply that 


Up X -yt-e (7) >d=1 


OSk<n 
pik 


and since this p-adic valuation is an integer, the result will follow, 
Now, to prove the claim, we will use the following: 


Lemma 3.10. The polynomial ST} _o k* (R)x* is a multiple of (1+ X)""* for 
alls <n. 


Proof. This is very easy: for ¢ = 0 it is clear, and if 
nm 
we (x = (1+ X)"°f(X) 
k=0 


then we get the inductive step by differentiating and multiplying both sides 
by X. 


Coming back to the proof, write 


rm 
ye ()x* == (1+ X)™°f(X) 
k=O 
for some f € Z{X] (note that we necessarily have f € Z[X], as (1+ X)"* 
and Sig b°({)X* have integer coefficients and (1 + X)"~* is monic). Then 
for? 1 <j <p we have 
n 


ti (2) =a oe 


k=0 


so we only have to deal with j > 1. 
dey $= AB Séx=y? 
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and so 


Thus, the claim is proved and the result follows. i) 


We end this chapter with a wonderful result due to Skolem, Mahler and 
Lech. It truly shows what a versatile tool p-adic analysis can be. Even though 
the result still holds for p = 2, we will assume that p is odd in order to avoid 
some technical problems. 


@24. a} Let p be an odd prime and let ao, a),... be integers such that 


nin 
K 

wr ({)ae=0 

k=O 


for infinitely many positive integers n. Prove that for all n we have 
that an = 0. 


b) A sequence (ay), of integers satisfies 
Oned = L1idy+d—-1 + T2Gnzd—2 +++ + Zddn 


for all n > 0, where d > 1 and x1, 29,... ,<g are integers. Prove that 
there exists a finite set S and integers ¢1,¢2,...,¢n,d1,do,...,dN 
such that 


{n > Olan = 0} = SU (e, + diN)U---U (ew + dy). 


Skolem-Mahler-Lech theorem 
Proof. a) Consider the following function 
_ Wr £ 
ie) = rt (7). 
EDO 


Of course, such a series cannot converge as a series of real numbers, but it dees 
converge as a series of p-adic numbers, since vy (p ay (f)) = & for all x € Zy 
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(note that (7) € Zp because the map x -» (*) is continuous and takes values in 
Zp when x is in the dense subset Z of Zp). Thus f defines a map f: Lp a Zp. 
The crucial property of f is that it interpolates the sequence S7;_o?’ ( Lae 
and that it converges to its Taylor series (has good analytic behavior). The 
first claim is obvious, since by definition we have 


iia 
= a 
40) = (7) ae 
k=O 
The second claim is more delicate. Let us write 
© 1 x ; 
(i) kt de bj 40? 
j=0 
for some integers bj;,. Then we can write 
k 


f(x)= eo So by xe? = So a?, 


keo G0 20 


ss j. 1 TTS) a Fs) > since 
where c; = Dee Ponta Note that the series defining c; converges, since 


tends to 9c as k oo. The same estimate shows that 


, p-—2 p-2 
Mp lez) = c+ = 3 
vple;) 2 int aA 


(this is why we assumed that p > 2). The conclusion is that f converges to its 
Taylor series and so behaves as a holomorphic function, i.e. has good analytic 
properties. ; 

Now, by hypothesis we know that f(m) = 0 for infinitely many integers 
n. We want to prove that f = 0. This will imply that f(n) = 0 for all x and 
it is easy to see that this forces a, = 0 for all n, Since Z, is compact, we 
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deduce that there exists a € Z, and an infinite sequence of integers nj such 
that f(nj) = 6 and nj converges p-adically to a. Now, for all x € Zp we can 
write 


Again, the series defining dy = )>,., cj (j)a/~* converges because up(ej) “3 00 

and we also have 2 
‘ p- 

Uplde) > inf vp(¢;) > eT 


Recall that f(n;) = 0 for all 7. On the other hand 


Up SS da (ny ~a)F} > Up(nj ~ a) ~ 00, 
RE1 


so that limj.. f(nj) ~ do = 0. We deduce that dy = 0. Dividing the equality 

J (nj) = 0 by a— nj and repeating the argument, yields dj = 0, then dz = 0 

and so on. We deduce that all d;’s are zero and so f = 0. The result follows. 
b) We may assume that xq 4 0. Consider the matrix M defined by 


Mag = Ljmiti 


for i <n and whose last row is Td, €d—1,-..,21. This is the companion ma- 
trix associated to the characteristic polynomial X¢ ~ aX"! — ...— avy of 
the recursive relation. Let V, be the column vector whose coordinates are 
Gn, An41,+--;4n4¢-1- Then the recursive relation becomes Vazi = MY,,, thus 
Vi = M"V. If e; is the column vector whose coordinates are 1, 0.0,...,0 and 
if (,) is the standard inner product in R%, we deduce that a, = (MPV, e1). It 


124 Chapter 3. Look at the Exponent 


is easy to check that det M equals cg up to a sign. Pick a prime p > 2+ ah 
so M is invertible mod p. Using either Lagrange’s theorem or the pigeon! — 
principle, we can find r > 1 such that M4” Ta (mod p). aes can writ 
M? = Iy+pN for some matrix N with integral coefficients. It ist nen sae 
to prove that for any 0 <j <r -~ 1, the set Aj = {n> Dloga+5 = 0} is ei o 
finite or contains all nonnegative integers. In order to prove this, the point is 


to write 


=. (n\ 4 
Gna+j = (Ta + pN)"MiVo, €1) = (he bk, 
R k=O 


where b, = (N*M?Vo, e1) is a sequence of integers. If A; is infinite, then part 
a) of the problem shows that A; contains all nonnegative integers and the 


a 
result follows. 


Remark 3.11. Under the hypothesis p > 2, the map f is analytic on the whole 
Zp. Vf p = 2, one can still prove (using for instance Amice’s theorem 3.B.30) 
that f is locally analytic on Zp. The proof is then exactly the same as in the 
case p > 2. 


Remark 3.12. The result holds for sequences with values in any field of ches: 
acteristic 0, as Lech proved. The key point is that we have a p-adic version 
of the Lefschetz principle (the proof is not easy, but elementary, see [13] 
[45]): if S is a finite subset of a field K which is finitely generated kaw a 
then for infinitely many primes p there is an embedding of K into Q, send 
all elements of S to Z,. Applied to the roots of the characteristic ea 
of the recurrence relation, this reduces the proof to the p-adic case, which has 
already been discussed. 


Remark 3.13. The result: does not hold for fields of positive characteristic. 

For instance, the sequence dn = (1+ #)" - 1— t is linearly LecurSIS _ 

values in F,((¢)), but the reader can easily check that it vanishes precise y at 

{pn > 0}, which is not the union of a finite set and finitely many arithmetic 
= OF, 

progressions. 
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3.7 Miscellaneous problems 


Before attacking the following problem, let us recall aclassical property 
of the Fermat, number F, = 22" +1, namely that any prime factor p of F, 
satisfies p = 1 (mod 2°*1). Indeed, if p divides F,,, then p divides 22°77 — 1, 
but does not divide 2?” ~ 1. Thus the order of 2 mod pis 2°+1 and since this 
order divides p—1, the result follows. The next problem is a generalization of 
some very classical problems, such as the following one: find all integers n such 
that the congruence zy = 1 (mod n) implies the congruence z = y (mod n). 
It. is easy to see that this is equivalent to nlx? — 1 for any 2 relatively prime 
to n and that this happens if and only if n divides 24. The next problem is a 
bit trickier. 


25. Let m be an integer greater than 1. Suppose that a positive integer n 
satisfies nia” — 1 for all integers a relatively prime to n. Prove that 
nS 4m(2”" — 1) and find all cases of equality. 


Gabriel Dospinescu, Marian Andronache, Romanian TST 2004 


Proof. Write n = 2*r with r odd, By the Chinese Remainder Theorem, there 
is an a with a = 3 (mod 2") and a = 2 (mod r). Such an a is clearly relatively 
prime to n, so nja™ —1. Hence 2 = @™ = | (mod r) and 3" = a™ = 1 
(mod 2*). Thus r|2” — 1 and 2*\3™ — 1. One easily checks that (3" — 1) = 
2 + ve(m), hence 2*l4m. We conclude that nl4m(2™ — 1) and in particular 
n < 4m(Q™ — 1). 

Now suppose equality holds. The above argument actually shows that 
nm <4. Qe). (9™ _ 1) g0 for equality to hold we must have m = 27( that 
is, m must be a power of 2. Suppose m = 2°, Then 


om 242 (2 ~ 1} 2?.3-5... (2! 41), 


that is, » is a power of 2 times some number of consecutive Fermat numbers, 
The cases s = 1 and s = 2 give two equality cases (m,n) = (2,24) and (m,n) = 
(4,240). Suppose now that s > 3, and let p be a prime divisor of 22° +1. 
The preliminary discussion shows that p = 1 (mod 2%), in particular p = 1 
(mod 8), so that by quadratic reciprocity, 2 is a square mod p. The Chinese 


126 : Chapter 3. Look at the Exponent 


Remainder Theorem gives an a relatively prime to n with a? = 2 (mod p). 
Then nja™ — 1, so 22 = @?* = a@™ = 1 (mod p). But this contradicts the 
fact that 2?" = —1 (mod p). Thus there are no solutions with s > 3. mt 


Here is a beautiful problem, for which we present two elementary proofs, 
both ingenious, neither quite natural. In the addendum to this chapter, we 
will present a more natural proof which uses p-adic analysis. 


26. Let x, be the exponent of 2 in the prime factorization of the numerator 
2 g2 gr 


of T + a -+-+-+ —, when written in lowest terms. Prove that 
n 
lim 2y = co 
n=O 


and that xgn > 2" —n+1. 
Adapted from a. Kvant problem 


Proof. The first proof is based on the following nice identity known as Staver’s 
identity, which also appeared as a problem in the USA TST 2000. 


Lemma 3.14. For any positive integer n we have 


n ae n Bac? mie nti 2B. . gntt 
0 i "An get ly" 9 ntl)’ 


Let us accept this lemma for a moment and see how we can conclude. 
Using the extension of v2 to all rational numbers, we can write 


ted ak - 
v2 > E Snti-w(n+ lb — may, ((°)) : 


kok 


Since 
v9 ((")) = 89(j) + s2(n ~ 7) — 8e(n) < 2(1 + logg(n)) —1 


and since v2(n+1) < logo(n+-1), this trivially implies that 2, — 90, Moreover, 
since each of the numbers cae) is odd and since there are an even number 
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of terms in the sum )77_97 axl, it follows that ayy > 2" —~ N44, ending 


the proof of the second part of "the problem. 

Now, let us prove the lemma. We will present two completely different 
approaches, the first one very down-to-earth, the second one more exotic, but 
more powerful also. 

Denote the left-hand side by a, and the right-hand side by by. Clearly, 
Qg = bp = 1. Note that 


+1 2 2 ig + 
bn = (G4 tot) ea Pet: 
n 2n 


Qn 2 LL F 
On the other hand, 


n+ 1 


“Sq nt l=it- 
Las 1S 4D + (n= 9) ia -1 4)! 
"2 rr n-(n~—I)I 
te G4IMn-1-d! + a(n -9! 
2 = ni 
i % as -1 oo nel ‘a -1 
EO EO 
im] a= 
= Gn. 


‘Taking into account these two relations, it follows by induction that a, = 
by, for all n and the lemma is proved. 
Now, let us turn to the second approach. We will use here the classical 
formula 
alb! 


1 
ta, OM a SE 
f (1 ~ td (a+641)! 


which can be proved separately for each value of a + 6 and then by induction 
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on say @ and integration by parts. We deduce that 


n 


2 1 I Par 
Sara Leroy (1 —1)"7Fat 


k * ksf) 
Lpretl re] axe art 
= Sa ERE a 
(n+ yf oT di 


Making the substitution 2¢ ~ 


s yields 

Lynti _ ¢y _ pyrti 1 ntl fy ntl 
| dinero Ciao) krone : eisai ra) AE 
0 2-1 gtk fy 8 


and noting that the integrand is an even function of s, the last expression is 
also equal to 


SB 


fy f) (Ste ner) 


s 


antl fy 


aol Sf (atat+a-o) as 
ont = Io ‘ 7 


1 on f(et-1 i 
7 gett ee i+] si) ; 


from where the result follows trivially. 


Proof. This second elementary solution actually mimics the p-adic analytic 
proof, without mentioning p-adic numbers. Define the sequence of polynomials 


and F,(X) = Lp(2X ~ X*) ~2L,(X). Since 


F(X) = 21 ~ X)Li,(2X — X?) — 201 (xX) 
_ 2XM(1 ~ (2 — X)") 

=, : 

-2X"(1+ (2— XJ +--+ (2- XP} 


| 
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is of the form X" (ag +a ,X +++ -+@,.1.X""") for some integers a; and moreover 
F,,(0) = 0, we deduce that we can write 


; ag ay _ Gn-t ee 
FLAX = Xr ees Xvi xrel . 
n(X) (arta TOR 
Therefore, since L,(2) = ~}F,(2), we can write 
2 92 Qn Rob otha, 
7t9t ae etter’ 


immediately implying the estimate 
wv cre an re = in (n+k (n+k+1)) 
ea tee ocr mi aay kb ; 
207 2 nf ~ OSk<n-1 at 


As vo(n-+k +1) < logo(n +k +1) < logg(2n), the fact that x, — oo is now 
clear. The other inequality is also trivial using the previous estimates. 


We continue with a rather technical problem. The solution we present 
is long and complicated, but nevertheless rather natural. This problern also 
appeared on the IMO 2007 Shortlist. 


27. Find the exponent of 2 in the prime factorization of the number 
grt gn 
Cm )~ (oe): 
J. Desmong, W.R. Hastings, AMM E 2640 


Proof. Note that 


Com) 2:8 (2411) (24+ 1)(2" +.3).--(Q* 4.2" — 1) 


(2) 0-8 Hp Dey 
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so that we need to compute 


vy (a) (2 + ities a ot =) :)) 
= 1+ u9((2" + 1)(2" +3)---(2" 42" —1)-1-3--: ("—1)). 


Looking at small values of n, we conjecture that for n > 2 
vg((2" + 1)(2" +3) +++ (27+ 2% ~1)-1-3----- (2° ~1)) = 38n-1. 
To prove this, we start by expanding the product 
(2” +.1)(2" + 3)---(2? + 2" — 1) 


as if it were a polynomial in 2”. If we work mod 23”, the only terms that 


count are the first three: if@ = 1-3---++ (2" — 1), then 
gro} ~1 1 
ua yy re rm on 
(2" 4+ 1)(2" +3)---(2% 42% ~1) =a+a-2' > 4 
427? a > pe eee (mod 23"). 


(2¢ + 1)(27 + 1) 


Osta j7<Qr-2_-1 


For simplicity, let 2; = 


a = a 
which combined with the previous observation yields 
vo((2" + 1)(2" +3). (2" 4 2-1) -1-3---6- (2” — 1)) 
QR hed 
= Qn 1+ v2 x Lips +2 s Liz]. 
=0 Osi<jg2n~1—4 
We will assume that n > 2. Then 


gn arta 
v2 S aj] =v | or} Se xy] 2n-], 


i=O 
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so 
2 anti} 
a. a (Sx) -Yiade~ s x? (mod 2+"). 
Osi< json} i=0 
Since 7 
Ry, = — 2% Too, = —2?—2"23 (mod 2?r), 
we obtain 
QPahet 
2, mm t2 DT aay 
i=0 O<i<j<2"-1—1 
gr-hay gre dy 
aw-2 $0 af-2" S* 2 (mod 2"*4). 
i=0 i= 
Finally, we clearly have 
gee a =j 
2” X a? =0 (mod 2"+1) 
and 
greta 
—~ 2°(4" — 1 
MO ape rss?te. 4 (ar- ya! a ) (mod 2"), 
i=0 


because the remainders mod 2” of the numbers 54; are the same as the 
remainders of the numbers 2i + 1. Putting everything together, the result 
follows. 


Remark 3.15. There are incredibly many congruences concerning binomial 
coefficients. A very good exercise for the reader is to prove the following 
theorem of Ljunggren from 1952: for any p > 5 and any integers a and 6, the 
number () ~ (§) is a multiple of p?. A more difficult result, due to Zieve 


(1999) is the fact that ee ‘a ay) ~ (§) is a multiple of p®" for any n > 1 and any 
integers a,b, if p is a prime greater than 3. 
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The last. problem of the chapter is quite technical, but also very beautiful. 
A much weaker result was proposed as problem 3 at the IMO 2008. This kind 
of result is actually very old and classical. 


28. Prove that for any c > 0 there are infinitely many n such that the largest 
prime divisor of n? +1 is greater than en. 


Chebyshev, Nagell 


Proof. Let f(n) be the largest prime factor of P, = (17+1)(2? 41) ---(n? +1). 
We will prove that lim i@) = oc. This is enough to deduce the desired result: 
Te OG 


indeed, choose any ¢ > 0, then for sufficiently large n we have f(n) > cn. By 
definition there exists ky, <n such that f(n)|k2 4-1. Then the largest prime 
factor of od +1 is greater than ck, and we are done (note that ky, ~> oo as 
n+ 00, since f(n) divides k2 +1). 

To prove the result on f(n), we will bound the p-adic valuation of prime 
factors of P,. Let Ap be the set of prime numbers smaller than or equal to 
n and such that p= 1 (mod 4). Any prime factor of P, is in the set A,244- 
The following lemma is the first crucial ingredient: 


Lemma 3.16. a) For any odd prime p we have 


Up( Pr) < log,(n? +1) + 


b} We have that vo(P,) = |"). 


Proof. The second part follows from the fact that the factors of P,, alternate 2 
(mod 8) and 1 (mod 4). To prove a), let n; be the mumber of j € {1,2,..., 2} 
such that p'|j? +1. Clearly 


pee) 


)= Loe! +1) deg Th. 


Note that if p > 2 and p' divides both j?+1 and k?+1, then p’ divides j~k 
or j+k, Indeed, p' divides (j —k)(j +k) and p cannot divide both j~k,j+k 
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as p nee not divide 7,& and p > 2. This being said, if j, is the smallest index 
with p'|j? +1, then all k such that p|k? +1 are of the form 7, + sp* for some 
O<s< ao or of the form ~j; + tp’ for some ia <t< aA We easily 


deduce that nj; < 1+ A when p is odd and the cenilt flies: by adding up 
these estimates. Q 


Using the previous observations, we can write (c; > 0 is an absolute 
constant and x(n) is the number of prime numbers not exceeding n) 


2logn! < log Py 
= Up(Pn) log p + val Py) log 2 


Psi(n) 
BEA, 244 ' 
2 2n 
< s log, (n FY +o a log p + cin 
= 
pS Fla) : 
PEA, 2,4 
log 
<S Blogn- r(f(n)) + 2n s wer heyn. 
PSF (nr) 
PEA 21; 


Now, corollary 3.4.2 shows that a(n) < Coen for all nm and some absolute 
constant ¢2 > 0. The crucial fact is that 


> log log f(n) +0(1), 
p 2 

4lp—1 

PSH (n) 


a nontrivial result that follows from our proof of Dirichlet's theorem (see ad- 
dendum 7.A for more details). Taking this into account and dividing the 
previous inequality by nlogn yields 


gles < Be2f(n) ¥ log f(n) ro(;2.). 


nlogn ~ nlog f(n) logn logn 
Combining this with the fact that limp... ee = 1 easily yields what we 
want here. oO 
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Remark 3.17. Nagell [60] actually proved the following general result: if f € 
ZX] is not a product of linear factors with integer coefficients and if P(r) is the 
largest prime factor of f(1)f(2)--- f(x), then there exists c > 0 (depending on 
f) such that P(x) > ex log x. The proof consists in a refinement of the previous 
method, combined with rather deep estimates in analytic and algebraic number 
theory (more precisely, the prime ideal theorem). Erdés [27] proved that there 
exists c > 0 depending on f such that P(x) > x(logx)°%e!slos. It is a very 
deep theorem of Hooley [41] that there exists c > 0 such that for all x > 2 the 
largest prime factor of The 7 (n? +1) is greater than cxid, The proofs of these 
theorems are very involved. 


3.8 Notes 


We would like to thank the following people for providing solutions to 
some of the problems in this chapter: Xiangyi Huang (problems 8,26), Ofir 
Nachum (problem 10), Fedja Nazarov (problem 8), Richard Stong (problems 
3, 11, 20, 25, 27), Victor Wang (problems 8, 21), Gjergji Zaimi (problems 3, 
4, 25), Alex Zhu (problem 27). 


Addendum 3.A_ Classical Estimates on 
Prime Numbers 


Since we are using quite a lot of estimates about prime numbers in various 
places of this book, gathering these results in one addendum seemed more 
than appropriate. All results here are absolutely classical and go back to the 
beautiful ideas of Chebyshev, who was probably the first person to put some 
order in the chaotic world of prime numbers, These ideas were revisited by 
Erd6és and their exposition is heavily influenced by this “update.” Basically, 
everything follows from some very smart applications of Legendre’s formula 
to middie binomial coefficients. 

For the reader’s convenience, let us recall some standard notation. We let 
a(n) be the number of primes less than or equal to n. If g is a map taking 
positive values when the argument is large enough and if f is any complex- 
valued map, we say that f = O(g) (respectively f = o(g)) if fo is bounded 
(respectively tends to 0) when x ++ oo. The crucial estimate that we will use 
when dealing with the behavior of +(n) is the following easy consequence of 
Legendre’s formula. 


Theorem 3.A.1. For any n > 2, (:%)) divides pit.) and is a multiple 
3] Sn 
of Teg) <pcn?- 


Proof. The second part follows immediately from the identity (note that n = 


[s+ 4) 
aq) I 4 
(5 ( ) ae 


and the fact that Thess] p divides the right-hand side and is relatively 


<pen 
prime to |31!. For the first part, Legendre’s theorem yields 
3 


(Ca) 2 (al - Ee] ). 
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As for all @,b € R we have {a + | — [a] — [6] € {0,1}, all terms in the sum are 
equal to 0 or 1 and all terms for j > log, n are zero. Thus vp ((@p) < [log, mJ 
and the result follows. o 


; : a(n)-logn 
The famous (and deep) prime number theorem asserts that. a(n):log mn con- 


verges to 1 as n + oo. The following result gives a uniform lower bound 
estimate. Of course, it is weaker than the prime number theorem, but it is 
rather amazing that with so few tools it already gives the “correct” lower 
bound. Note that log 2 is approximately 0.69. 


Corollary 3.A.2. For alin > 2 we have 


n 
) > 
6 log 2 iach n(n) 


Proof. Using theorem 3.A.1 for N = (?), we obtain 


log N == pS up(N) logp < s flog, (2)] log p < (2) log(2n). 


psan pS2n 


Next, N is the largest among the (7) and >, je) = 4”, hence N > eo 
meat log N = 2n log 2 — log(2n + 1) 


log(2n) ~ log(2n) 
Using this inequality as well as r(2n +1) > m(2n), a small computation yields 
the lower bound for 2 > 6 as in the original statement; it is easy to check 
directly the cases 2 <n < 5. 

Using theorem 3.A.1 again yields 


; 2n 
6 gt2n)—m(n) Il < <4", 
Th < Pps ( ”) pa 


npS2n 


a(2Qn) > 


which applied to n = 2* gives 


n(ak) a x(a") KS Se x 
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Combined with the obvious inequality (2*+!) < 2, this implies the inequality 
which easily implies that 7(2") < aan Finally, we have 


gitflog, x] 6n 
= See fy ae gO 


< n(2itllesanly < 3. , 
mn) S a ) 1+ floggn] ~ loggn 


Before tackling the next result, we will prove a very elementary, yet pow- 
erful inequality due to Erdés: 


Theorem 3.A.3. For any n > 2 we have Tp<n p<4r-), 


Proof. The proof is by strong induction, the inequality being clear for n = 2. 
Suppose that it holds for all numbers smaller than or equal to nm and let us 
prove that To<nta p< 4". Ifn +1 is even, this is clear, so suppose that 
n == 2k. Note that Theercpcor 4.2 P divides Ce 1) so it is certainly at most 
equal to (74*"). Combining this with the induction hypothesis for k gives 


Qh+1 
I »= I] > Il p< ( k \ se 
pened pok+) R&B pcak+] 


the last inequality being a consequence of 


ae 2k+1 Qk+4 Qk+1 
AR = aed, Pree ‘aa 
2-4" =(141) +( k \+Gnt)+ > 2 k ). a 


A slightly trickier argument yields the following beautiful theorem of Mertens. 


Theorem 3.4.4. We have 


3 wee =logn + O(1). 


Pr 
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Proof. We will use the prime factorization of n!. Legendre’s formula yields 
n logn n 
meeLmgred Ur | (nl) << ——. 
p-l ( — i) on onl) < on7 
Multiplying this by logp and summing over p < 7 yields 
log p 
— log J[e- a(n) -logn < logn! ~ aS) eat <0. 


pan pon 


Using Erdés’ inequality [],,<,,p < 4", the previous estimates on (7), and the 
inequalities nlogn > logn! > n(logn— 1) (the first one is obvious, the second 
one follows easily by induction using the inequality log (1 + 4) < 4) yields 


psn 


The theorem follows from this estimate and the fact that the series 7, aoe 


converges (since ney < aE if p is large enough), o 


The next result. is a simple application of Abel’s summation and of the 
previous theorem. 


Theorem 3.A.5. We have 


S- J 2 log logn + O(1). 
P 


psn 


Proof. Define a, = logn if n is a prime and 0 otherwise. By the previous 
theorem there is a bounded sequence r, such that 


Sp = GQ +--+ + On = logn+ tn. 


Thus 


sri ash Safer By, is), 
p logk ne ioe log(k + 1) 


kd 
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The triangle inequality yields 


eS 1 
vn (ago oath isaleTH) = O(1), 


8o it remains to prove that 


nw 
log k 
> ( - rater) = loglogn + O(1). 


k=2 
Note that for & > 2 


_ logk _ j Kop dt 
log(k +1) cari, i 


k+1 
o< [ dt 1_ log k& 
» tlogt log(k + 1) 


_ [rs (log(k + 1) — log t) dé 
ke tlog t log( (E41) 


Hence 


1 
<o ayer 
~ k*(log 2)?" 


where we have used 
log(k + 1) — logt < log(k + 1) ~ log(k) = log(1 + 1/k) < 1/k. 
Since the sum of the upper bounds converges, we have 


= logk ng 
> (: = ater) = y OU) milon loge OU). 


P=) tlogt 


Remark 3.4.6. One can actually say much more and the true content of 
Mertens’ theorems is the following: there is a constant ¢ such that 


i 
5 = oslogn + c+0( 1 ) 
Pp 


jee logn 
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and if 7 = limpsoo (1+ 3+::>+ 4 ~logn), then 


I] ( ie *) ~ logn a (ar: ; 


Por 


The first. estimate is not difficult and uses again Abel’s summation formula 
combined with basic integral calculus. The second estimate is much trickier. 


Addendum 3.B_ An Introduction to 
p-adic Numbers 


This rather long addendum is an introduction to the wonderful theory of 
p-adic numbers and their applications. This is a vast subject and the literature 
concerning it is huge, so that we cannot even properly scratch its surface. 
However, even a glimpse into the subject reveals amazing things... 


For a variety of reasons, reduction mod pis awfully insufficient. The best 
way to reduce modulo arbitrary powers of a prime p while still working in 
& reasonable algebraic (and especially analytic) context is using p-adic num- 
bers. Very roughly, a p-adic number is a kind of “analytic function of p” 
with coefficients taken mod p. So, p-adic numbers will be infinite expansions 
ps hoS—08 ayp*, where a are integers between 0 and p— 1. This is a mixture 
of the classical idea of decimal expansion and the more analytic Taylor expan- 
sion of a nicely behaved complex-valued function around 0. The idea of seeing 
integers as analytic functions of primes is incredibly powerful and appears all 
the time in rnodern number theory. 


Though the best way to define the field of p-adic numbers is by completing 
Q with respect to the p-adic absolute value, we will introduce p-adic numbers 
algebraically and then develop their analytic properties. We think that this 
is a bit easier to digest. Next, we briefly discuss what happens when one 
takes a finite extension of the field of p-adic numbers, which gives us the 
opportunity to discuss valuations and absolute values from a more abstract 
(and useful) point of view. This discussion reveals a huge complete extension 
of Q,, denoted C, and called the field of complex p-adic numbers. Its mere 
existence has some amazing consequences, for instance a beautiful geometric 
result of Monsky, concerning tiling of squares by triangles of the same area. 


After discussing classical analogues of the exponential and logarithm 
maps, we focus on the p-adic analogue of the complex Gamma function, This 
is the most technical part of the addendum, but also the most rewarding. It 
requires a preliminary discussion of Mahler expansions and p-adic integration, 
which are rather complicated, but once the machine is sufficiently developed 
one can prove deep congruences in a quite straightforward way. We highly 
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recommend the wonderful books [13], [49] and [67] for a much more thorough 
treatment and many good examples. 


3.B.1 Arithmetic of p-adic integers and p-adic numbers 


A nice way (though maybe not the most intuitive) to see a p-adic integer 
js to understand it as a compatible system of residue classes mod p”, for all 
n. That is, following Hensel, a p-adic integer is a sequence (Z,)n>1, where 
each %, is a residue class mod p”, say the class of an integer a, and where 
Gni1 = Gy, (mod p") for all n. This simply says that Z,41 maps to Ty under 
the natural map Z/p"*?Z + Z/p"Z. With this description, it is fairly clear 
that the set of p-adic integers becomes a ring, if we define the addition and 
multiplication component-wise (i.e. the sum of the sequences (En)n and (Jn}n 
is declared to be the sequence (€_ + Yn)n, similarly for multiplication). To 
avoid useless repetitions, let us give a name to such sequences: 


Definition 3.B.1. A sequence (En)n>i: En € Z/p"Z is called compatible if 
Inti = Tp (mod p") for all n, where zp € Z is any lifting of Zr. Let Zp be the 
ring (for the previously defined operations) of all compatible sequences and 
call 3t the ring of p-adic integers. 


Note that Z lives inside Zp, as any integer n can be thought of as the 
compatible sequence (n (mod p*)),. The map sending n to this sequence 
gives an injective ring morphism from Z to Zp. We always identify an integer 
and its image in Zp. 

It turns out that our new ring Zp, has very nice properties, both alge- 
braically and topologically, making it by far easier to handle than Z (you 
might have not noticed, but Z is a very complicated object, actually. ..}. We 
discussed quite a lot the p-adic valuation for integers and rational numbers 
and we will see that it naturally extends to Zp, making Zp a beautiful place to 
do analysis. However, we need some preliminaries in order to make this dream 
reality. The following result is crucial for the arithmetic of Z, and shows the 
big difference between Z and Z, as far as arithmetic is concerned. Recall that 
a unit in a ring is an element that has a multiplicative inverse in that ring. 
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. a, 
T nem 3.B.2. Any nonzero p-adic integer x can be uniquely written 
a= p"u for some nonnegative integer k and some unit u. 


Proof. Before going on to the proof, let us characterize the units of Zy. This 
will also play an important role in the proof of the theorem: 


Lemma 3.B.3, A compatible sequence (xn) defines a unit in Zp if and only 
if its first component is nonzero. 


Proof. One direction being obvious, let us assume that the first component is 
nonzero. By compatibility, all z, are relatively prime to p, thus their classes 
mod p” are invertible. Simply choose yp to be the inverse of x, mod p” 
and check that it forms a compatible sequence, which is the inverse of x by 
construction. oO 


To prove uniqueness of the representation given in the theorem, we need 
the following easy 


Lemma 3.8.4. If € Z, and p™« =0 thenz =0. 


Proof. By induction on m we may assume that m = 1. Next, write z as a 
compatible sequence (#,), and observe that the condition px = 0 simply says 
that pz, = 0 in Z/p"Z. This means that p’~!|a, for each n > 1. But since 
p” divides xn41 ~ tp, we see necessarily that p"{zp, so in fact all components 
of x are zero. im 


Assume now that x = p*u = p'v for some u,v units and some nonnegative 
integers k,l. If k > 1, lemma 3.B.4 yields p*—'u = v. As v is invertible (in 
other words a unit), we have a contradiction with lemma 3.B.3. Similarly, we 
cannot have k <1,so0 k= 1. Applying lemma 3.B.4 once more, we get re uv, 
which proves the uniqueness part of the theorem. 

To prove the existence, write ¢ as a compatible sequence and let m be 
the largest. integer j such that x; = 0 (mod p’). Then y, = “242 are integers, 
since by compatibility tn4m = £m = 0 (mod p”), Moreover, since x, is 
compatible, so is y,. Then by construction (and the compatibility of (Z,)n), 
the sequence y, defines a p-adic integer y such that p”y = x. We claim that 
y is a unit, which will finish the proof of the first part of the theorem, But 
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the first component of y,, does not vanish, so the result, follows from lemma 
3.B.3. Q 


Here is an important consequence of the theorem: 
Corollary 3,B.5. Zp is an integral domain. In other words, if ab = 0 then 


a=O0 orb=0. 


Proof. This follows immediately from the previous theorern and the second 
lemma in its proof, O 


If R is an integral domain, one can define its field of fractions: formally, 
it is the set of all symbols ¢ with a,b € R and } ¢ 0, it being understood 
that we identify $ and § if ad = bc. Addition and multiplication of fractions 
being done as in elementary school, this yields a field. Applying this to Zp, 
we obtain the field of p-adic numbers. 


of 


the second equality being a consequence of theorem 3.B.2. 


a,b € Zp,b# 0} = Fla E Zpy,n € N}, 


3.B.2 The padic valuation revisited 


We will give a more analytic flavor to Q,, by endowing it with an absolute 
value, which plays the same role as the usual absolute value on real numbers. 


Definition 3.B.6. Let « € Q, — {0} and write (according to theorem 3.B.2) 
a = p*u for a unique unit u and a unique integer k. Call k = up(wx) the p- 
adic valuation of a and |al, = p7’) the p-adic absolute value of x. Define 
Ol, = 0. 


The following is an immediate consequence of the definition: 


Proposition 3.B.7. For all x,y € Qp we have 


ltylp = lalp:|ylp and |x + ylp < max(|elp, lulp), 


with equality if |z|p # |ylp- Moreover, |-|p extends the p-adic absolute value 


on QC Qp. 
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Note that the inequality |z + y/p < max({z|p, |ylp) satisfied by the p- 
adic absolute value is stronger than the usual triangle inequality for real or 
complex numbers. This has a whole variety of consequences, which make p- 
adic numbers a rather exotic object from a geometric point of view. However, 
this will not affect us, since we will deal with analytic aspects and for that it is 
enough to introduce a distance on Q,, a measure of how close numbers are to 
one another. The distance between x,y € Q, is defined by d(x, y) = |x — ylp. 
We can then define analytic objects as in the “real world” (i.e. in the world of 
real numbers, but we will leave it to you to decide whether that is really the real 
world...). For instance, there are obvious notions of convergent sequences, 
continuous functions, etc. Basically, any real-analytic object has a p-adic 
counterpart. Just to see how it works, let us give the definition of a convergent 
sequence: 


Definition 3.B.8. Say a sequence of p-adic numbers ¢, converges to a p-adic 
number a if |, ~ @lp converges to 0 in the usual sense, that is for all N > 1 
there is no such that lap, ~ aly < 1/N for all n > no. 


Intuitively, the sequence x, converges to a if the difference x, ~ a is more 
and more divisible by p when n is large, that is if vp(x, — @) goes to infinity 
as n -+ oo. If you think of a p-adic number as a compatible sequence, this 
means that for any &, if n is large enough (depending on &) then the first k 
components of zp, —4 are zero. The following result is absolutely fundamental: 


Theorem 3,B.9. Ifa, € Q, converges to 0 then the series Saco Cn converges 
in Q,. That is, the sequence whose general term is ry +a, +---+2p, COmVETgES 


in Qy. 


Note that this is NOT true for real numbers (think about the harmonic 
series!). Also, note the following important consequence: a sequence Zp, € Q, 
converges if and only if ry, ~ 2p, tends to 0 in Qy, a fact that will be used a 
lot in subsequent sections. 


Proof. Write sp = to+21-+++++4n, 90 that s, ~s,..1 goes to 0. Note that we 
may assume that all x, are p-adic integers: indeed, since z, goes to 0, tp is a 
pradic integer for n large enough. Multiplying all 2, by the same large power 
of p so that the first terms also become p-adic integers does not affect the 
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hypothesis or the conclusion. Next, write s; = (81, 3i2,...) as a compatible 
sequence. Thinking of these sequences as infinite rows of some infinite matrix, 
the crucial fact is the following: 


Lemma 3.B.10, For any there exists kp such that Bin, = 8j, for alli, 7 > kn. 
That is, every column of this infinite matrix eventually becomes constant. 


Proof. Indeed, note that for i > 7 we have 
Up (8; ~ 8j) = Uplajaa +++ + Bi) > pnt Up(Tk) 


and the last quantity goes to infinity as 7 -+ oo. Thus for i > 7 large enough 


we have up(s; — s;) > n, which implies that 3jn = 8jn. a 


This lemma gives us a candidate for the limit of the sequence s,: define 
the sequence a = (G1, G,...), where Gy, is the common value of the elements 
Sm for i large enough (using the notations of the lemma we have @, = Si.n)- 
It is then easy to check that this sequence is compatible and defines a p-adic 
integer which is the limit of the sequence sp. Oo 


The following basic result will be used frequently below. 


Proposition 3.B.11. Zp is a compact subset of Qp- 


Proof. Consider any sequence 2, of elements of Zp. Since the first component 
of x, (seen as a compatible sequence) takes only finitely many values, there 
exists a subsequence r,,(n) and an integer a; such that o,,(n) = a; (mod 2) 
for all n. The same argument yields a subsequence a, (59(n)) and an integer 
az such that £y,(.{n)) = a2 (mod p’) for all n, etc. Considering 


elm) = wr (ya(--en(m))-++)s 
we obtain a subsequence such that 2,(,) = @, (mod p*) for alln > k. It 


follows that (a, (mod p*)), is a compatible sequence, defining a p-adic integer 
a. By construction, we have limysoo Zin) = @ and the result follows. 0 
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Finally, let us give another fundamental property of p-adic integers, which 
shows that they are basically “formal power series in p” or “infinite base-p 
expansions.” 


Theorem 3.B.12. For any p-adic integer x there exists a unique sequence 
an € {0,1,...,p~ 1} such that 


oo 
go Sapp”. 
n=0 


By definition, this equality means that the sequence whose general term is 
ag aip+++:+anp" converges tox. Moreover, if an is the first nonzero term 
of this sequence, then vp(x 


Proof. If x is a p-adic integer, there exists a unique ag € {O,1,...,p— 1} 
such that «—~ag € pZpy. Indeed, it is clear that ag has to be (the lifting to 
{0,1....,9 ~ 1} of) the first term of the compatible sequence z. Using this 
remark, we deduce by induction that for any n there are unique @, @1,...,G, € 
{0,1,...,p—1} such that #~(ap-+ayp+---+aqp") € p""Z,. But this implies 
that ¢ = limn.oo(ay + app tes + ap"). The rest is essentially immediate 
using lemma 3.B.4 and theorem 3.B.3. 0 


So any p-adic number x can be uniquely written as a Laurent series 
z= 5 P*ax 
k>eN 


for some N and some a, € {0,1,...,p— 1}. Moreover, we have the following 
nice criterion to establish when « € Q. The proof is a bit tricky. 
Proposition 3.B.13. The p-adic number 


k>—N 


%3 a rational number if and only if the sequence (ay), becomes periodic from a 
certain point. 
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Proof. It is immediate to check that if (44), is eventually periodic, then x is 
rational (simply because p* + p°* +...) = oe in Q, for any a > 0). The 
amusing point is proving the converse. Multiplying x by a power of p, we may 
assume that x € Zy, say © = Yheo agp’. Write 2 = ® for some relatively 
prime integers u and v and consider the sequence x, = yee asp’ —*. Then 
clearly x, = ay + prgy1. AS Zo = @ is rational, it is clear that all xz are 
rational. But much more is true: we claim that we can find y, € Z such 
that y;,| < max(|u|,|v}) (using the ordinary absolute value!) and 2, = %. 
Indeed, if this holds for x, then we can take y,.; = 4—"** (clearly Iyeval < 
max(|z|, v]); to see that yxs1 € Z, note that ay ~ ay € pZp, so p must divide 
Ye ~ vax). Now, the sequence (yp )x is a bounded sequence of integers, so we 
can find i < j such that y; = y;. Then 2; = aj and by uniqueness (provec 
in the previous theorem) we must have @j44 = @j41,4i49 = @jaa,.... This 
finishes the proof. 


The following is also absolutely crucial. It basically says that in many 
cases solving a polynomial equation in p-adic numbers is the same as solving 
it mod p, since any solution mod p will automatically lift to a compatible 
sequence of solutions mod p". 


Theorem 3.B,14. (Hensel’s lemma) Let f € Zp[X| and let a © Zy be such 
that |f(a)|p <1 and |f'(a)lp = 1. Then there exists unique b © Z, such that 
f(b) = 0 and |b-alp <1. 


Proof. We will prove by induction that one can find a sequence of p-adic 
integers a, with Gp = @,4n41 = %& (mod p"*?) and vp(f(an)) 2 n+ 1. By 
the previous theorem, the sequence a», will converge to a p-adic integer b and 
since vp(f(an)) > n+ 1 and f(an) converges to f(b), then f(b) = 0. To prove 
the existence of a sequence a@,, assume we constructed ag,...,@, and search 
for dng = Gn + pt", for some p-adic integer 5,. We need to ensure that 
Flan +p"*!b,) = 0 (mod p"**). Since 


L(@n +2" bn) = flan) +p’ bp flan) (mod p***), 


it is enough to take by, such that f(@,)+p"* "bn f/(en) =O (mod pt), which is 
possible as f’(a,) is a unit (because a, = ap (mod p) and f'(ap) is a unit). 


3.B, An Introduction to p-adic Numbers 149 


3.B.3 Absolute values and their extensions 
Definitions and Ostrowski’s theorem 


Let us start with an easy observation: @, is not algebraically closed. 
Indeed, the equation x? = p has no solution in Q,, since if x € Q, satisfies 
a? = p, then p~! = [pl, = [x], = |e]; and |2|p is of the form p~¢ for an integer 
a, & contradiction. Thus, it is meaningful to study finite extensions of Q,, as 
one is often interested in solving polynomial equations over @,. It turns out 
that all finite extensions of Q» also have natural absolute values that extend 
the absolute value of Qp, though this is not trivial at all. It is thus better to 
abstract the situation, using the following 


Definition 3.B.15. 1. An absolute value on a field K is a map |-|: .K ~+ 
R* such that |e} = 0 if and only if « = 0, |ayl = |x| -ly| and lz+y] < 
|| + |yl. The absolute value is called non-archimedean if le+yl < 
max(/z/,|y|). The absolute value is called trivial if |x| = 1 whenever 
«#0. 


2. Two absolute values are called equivalent. if there exists c > 0 such that 
ele = falf. 


3. A valuation on a field K is a map v : K -+ RU {oc} such that v(t) = 00 
ifand only if 2 = 0, (ey) = u(x) + v(y) and ole + y) 2 min(v(x), v{y)). 


It is clear that if |-| is an absolute value, then w(x) = — log |x} defines a 
valuation. If |-| is a non-archimedean absolute value on K’ , the ring of integers 
of K is by definition Og = {x € K|[z| < 1}. It is easy to check that this is 
a ring and that mx = {x € K|lx! < 1} is a maximal ideal of Ox. Thus 
ke = Ox/mx is a field, called the residue field of K’. 

It is clear that any non-archimedean absolute value is bounded by Lon 
4%, but the nice and somewhat tricky fact is that the converse holds. Indeed, 
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if jn| <1 for all n, then for all x,y and all n we can write 


< Soha} lyt-® < (n+ 1) max(|e), fyl)”. 
k=O 


Taking the nth root of this inequality and letting n — 00 yields [x + y| < 
max({jz},|y|), proving the claim. With these remarks being made, we are ready 
to prove the following beautiful result: 


Theorem 3.B.16. (Ostrowski) Any nontrivial norm on Q is equivalent to the 
p-adic absolute value for some prime p or to the usual absolute value. 


Proof. Suppose first that the absolute value | -| is non-archimedean. We know 
that |jmn/n| = |m|/in| A 1 for some nonzero m,n € Z. Without loss of gener- 
ality, then, let |m| < 1, so that for some prime p|m we have |p| < 1. If q is 
another prime with |q| < 1, then we may find integers a and b with ap+bg = 1, 
and then 


1 = |1| = lap + bq| < max{lapl, jbgj} < max(|p|, |g) < 1, 


a contradiction. We conclude that. | 
the p-adic absolute value. 

The difficult case is when | -| is archimedean. We saw that in this case 
there is an integer n > 1 such that [n| > 1, Pick any such n and write for 
all « > 1 the number « in base n, say x oF aynt-+++a,n*. Note that 
& < log, © and that if Cy, = maxy<j<n—;[j|, then 


: |p|’, and so |- | is equivalent to 


infe* 


In| —1 


la] < Jato) + |ei||n| +--+ largl|nl* < Cn < Aga Ii 


for some constant A, independent of x. Applying this to x’ for N large enough 
yields |x| < aloes lnl, 
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Now, we claim that for any integer z > 1 we have lal > 1. Indeed, if 
ja] < 1, by writing nd in base x and using the same argument as before, we 
deduce that 

[nl = |r| S$ CU + log, n’). 
As {n| > 1, this is certainly not true for 7 large enough, proving the claim. 

We have, therefore, that for all #,n > 1 both |x| < gta lPl and Ini < 
nies: ltl, so that (e.g.) the first inequality is in fact an equality. This implies 
that log,, |r| is a constant function of m > 1. Thus, there is d such that. |n| = n4 
for all integers n > 1 and the result follows. iia 


Extensions of absolute values 

We prove now the following fundamental and nontrivial theorem. The 
proof is pretty acrobatic and uses a nice mixture of analytic and algebraic 
arguments. 
Theorem 3.B.17. Let K be a finite extension of Qp. There is a unique 
extension of the absolute value on Q, to an absolute value on K. This absolute 
value is non archimedean and it is given by 


jal = ¥ NK, (lp 


ife € K, where n =([K + Qy] and Nxjg, @ the norm. 


Proof. We prove uniqueness first. We claim that for any two absolute values 
{-|a.[+ [2 on K that extend |- lp on Qp, one can find ¢1,c2 > 0 such that 
ealaly < jal2 < calzl; for all «. Assume that this happens for a moment. Then 


eyjelt = ale" < elb < ealelf, 
and taking nth roots and letting 2 —> oc we get jvl, = |a]2, proving the 


uniqueness part. Now, to prove the claim, it is enough to prove the following: 
if €1,€2,...,€n is a basis of K over Q, and if we define 


Sra = max |#ilp, 


: | i<ign 
{i=l a0 
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then there are ¢1,¢2 > 0 such that ¢|alo < |elr < calatloc. But clearly for 


we have 


n 
lair < SY leap ‘leh < 


i=l 


so we can take cy = 7 /e,];. Obtaining c, is more subtle. Let 


S=freé Kila 


wo = I}. 


If we equip K with the product topology induced from its vector-space struc- 
ture over Q,, then S is easily seen to be a bounded, closed subset of A, whence 
compact. Moreover the map z — {al is continuous on S, as say 


leh = |ylal S lz ~ yls S eal — loo. 


Because this map does not vanish on S, there is c; > 0 such that {a}, > ¢; for 
xéS, Asany x € K can be scaled by an element of Q, to become an element. 
of S, the claim is proved. 

Existence is harder. Defining |r| = / | 


K 1Qp(®)ips standard properties 
of the norm yield |ay| = |a| - lyl, jal = 0 2 = 0 and [z| = jalp for c € Qp. 
The difficult point is proving that |x + y| < max(|z],/y|). By multiplicativity, 
it is enough to prove that |z + 1| < max(1, |z|), which reduces to 


if jz] <1, then |x + 1) < 1. 


‘This is however quite subtle. We will actually prove the following result: there 
exists c > 0 such that |@ + 1| < ¢ whenever |z| < 1. Assume that we proved 
this for a moment. Applying it to z/y or y/« (according to whether |y| > [2| 
or not}, we deduce that for all x,y we have |x +y} < emax({a},|y|). But then 


for all d we have 
ered 
a (= y 


yl). 


1g @ = Ig dco < c(max{|z 
je + yl (@+y)"| $e max < e(max(|z|, 
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Taking dth roots and letting d -+ oo, we obtain |a+y| < max((], |y|), finishing 
the proof. 

It remains to prove the existence of c. This is similar to the first part 
of the proof. Namely, let ¢),...,¢n be a basis of K over Q, and let | - |. be 
as above. As the norm of an element of Q, is a polynomial expression of the 
coordinates of that element in the basis e1,...,€n, it follows that 2 > [2] is 
a continuous map. Since it does not vanish on the compact set {2|[z|.. = 1}, 
there are positive numbers a, such that @ < |z| < 6 whenever |x|, = 1. An 
obvious scaling argument implies that ala|o. < |z| < bla. for all z, from 
where 


Hea) <blt+zln <6 (: + ie) <bt+ g 
a a 


whenever |x] < 1. ‘The existence of c is thus proved and we are done. a 


The uniqueness property in the previous theorem ensures that if K C L 
are two finite extensions of Q,, then the restriction to K’ of the unique absolute 
value on L extending that on Q, is the unique absolute value on K extending 
that on Q,. This implies the following very useful 


Corollary 3.B.18. Fiz an algebraic closure Q, of Qp. There is a unique 
extension of |-|p to a non archimedean absolute value on Q,. 


From Q, to C, 


We have a bad news: after all the hard work in the previous section, we 
have to tell you that Q isnot a very good object. When dealing with p-adic 
numbers, analysis is intensively used and finite extensions of Q, are very good 
places to do analysis because they are complete. This meaus that all Cauchy 
sequences converge in such a finite extension (this also happens in R or C 
or in @ compact interval, but not in (0,1) for instance: the sequence I/n is 
Cauchy, but does not converge to an element of (0,1)). On the other hand, 
Q, is not complete (this is not really easy to prove, actually, but those with a 
good analytic background will observe that: it follows inunediately from Baire’s 
lemma), so one cannot do reasonable analysis on this field. 
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Let us explain why finite extensions of Q, are complete, since this is very 
important. The fact that Q, is complete was essentially proved while proving 
theorem 3.B.9. To see that a finite extension K is complete for the unique 
absolute value | - | extending | «|p, choose a basis e1,...,en, of AK over Q, 
and define [aloo = max;|a;| if x = S70, a;e; and 2; € Q,. This is a norm 
on K (but not an absolute value) and the same argument as in the first 
paragraph of the proof theorem 3.B.17 shows that there exist ¢;,c2 > 0 such 
that cz] < [z|sq < ¢g/z| for all z. Thus, the notions of Cauchy sequence and 
convergent sequence are the same for |-| and |-|.,. But it is clear (from the 
fact that Q, is complete) that K is complete for | - |, thus K is complete for 
|-|, too. 

Now, we would like to have a field that contains Q@ and which is still 
complete. [t turns out that there exists such a field which is moreover minimal. 
The situation is very similar to that of Q endowed with the p-adic absolute 
value: it is not complete for this absolute value, but adding all possible limits 
of all Cauchy sequences in Q one ends up with a much bigger field, Q,. One 
can play the same game starting with Q, and one ends up with a huge field 
Cp, endowed with an absolute value |-|, extending that on Q, and having the 
properties: 


1) The field C, is complete with respect to |-|p, that is any Cauchy sequence 
in C, (with respect to | - |p) converges in C,. Just as in theorem 3.B.9, 
we deduce that if a, € C, is a sequence converging to 0, then 0.) an 
converges in Cp (the notion of convergence is defined just. as for Q,). 


2) The field C, is algebraically closed, in particular it contains GQ, More- 
over, G, is dense in Cy. 


3) The residue field of C, is an algebraic closure of Fy. 


Here is the way one constructs C,: consider the set C of all Cauchy sequences 
in G,. It is easy to check that this is actually a ring, addition and muitipli- 
cation being defined component-wise. Next, one checks that the subset m of 
C consisting of sequences that converge to 0 is a maximal ideal of C. One 
defines C, = C/m. By definition, this is a field. It has a natural absolute 
value |-|,, defined by: if z € Cy, is the class of a Cauchy sequence (ay),, then 


erent 
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|elp = litnoo |@nlp. One checks that this is well-defined (ie. independent of 
the choice of the sequence (a,),) and it is an absolute value that extends the 
one on Qp. It is not difficult to prove that C, is complete for this absolute 
value and essentially by definition Q, is dense in Cy. It is more difficult to 
prove that C, is an algebraically closed field. 

The previous construction does not use any special property of @, except. 
the fact that it has an absolute value. In general, for any field K endowed 
with an absolute value, one can construct (in exactly the same way as above) 
a field K that contains K and has an absolute value |-| extending that of K’, 
such that K is dense in K . This field is called the completion of K. If K is 
algebraically closed, then K is also algebraically closed, though this is not so 
easy to prove. 


A summary 


The upshot of this technical section is that finite extensions of Q, behave 
as Q, both algebraically and analytically, that @, is a pretty bad field from 
the analytic point of view and that if one wants to do analysis, then one has to 
work with its completion C,. Moreover, in Cy a series ee Qy converges if and 
only if a, converges to 0 (which means that limps. lanjp = 0) and then one 
can permute its terms as one wishes and still get the sare value of the sum 
(this is definitely wrong in real or complex analysis!). Finally, one can deal 
with double sums in a rather leisurely way, since if littnax(m,n)—roo Gmn = 0, 


me 


and all series converge. Note that we did not prove the last two assertions, 
but. since they are rather easy, we leave them as good exercises for the reader. 


3.B.4 p-adic analogues of classical functions 


Recall that for any complex number t; the series )>,, >0 a converges to a 
complex number called e® and x e” is a surjective group morphism C -» C*. 
Let us study the p-adic analogue of this construction: the problem is that 
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p(n!) is quite large, so we cannot expect that the previous series converges 
for all «, Actually, by theorem 3.B.9 the previous series converges for some 
a € Cy if and only if vp (=) —> 00. Using Legendre’s formula v(m!) = ares 

where sp(n) = O(logn) is the sum of digits of n when written in base p, we 


deduce that the series converges iff 


' 1 S{n) 
im, n (vot - =) +- T= 


which happens if and only if vp(%) > ap i.e. fal <p rs, Moreover, one can 
easily check (using the remark on double sums made in the previous section) 
that. if z,y satisfy these conditions, then so does x + y and e* - e¥ = e* T¥, 

Tt turns out that one can construct an inverse to the exponential map, 
which is however defined on all C,. More precisely, we have the following 
nontrivial 


Theorem 3.B.19. There exists a unique continuous homomorphism log, : 
Cy + Cp such that log,(p) = 0 and 
=) 
tog,(2) = Y>(-1y =" 


rm 
nok 


for |x ~ ljp <1. 


Proof. (sketch) The proof is pretty long, so we only give the main steps. The 
erucial point is the following 


Lemma 3.B.20. Any x € Cy can be uniquely written « = p”-¢-v for some 
r&Q, ¢ a root of unity of order prime to p and v € Cy such that jvy—1) <1. 


Proof. Let us prove the existence part. By construction, v,(Cp) = Q, so that 
given any x € Cp there is r € Q and u € Cp such that x = p"-u and v,(u) = 0. 
Consider the image of u in the residue field F, of C,. It is a nonzero element 
of some IF} for some power q of p. Thus vp(ut! — 1) > 0 and then easily 
ul)" 5 1 as n — 00. We see similarly that ¢ = lim,4.. u*" converges and 
then (9-1 = 1 with wp(u—¢) > 0. So one can take v = u/¢. 
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For uniqueness, it is clear that r = vp(a) is uniquely determined. It is 
thus enough to check that no root of unity ¢ #1 of order prime to p satisfies 
j1—¢| <1. If ¢ has order n > 1, then the norm (from Q,(¢) to Q,) of l-¢ 
is a divisor of n, and so cannot be a multiple of p. a) 


Now, let us study log,. Let x € C) and write z = p"-¢-v as in the 
lemma. Note that if we admit that log, exists, then necessarily N log,(¢) = 
log, (¢%) =0if CY = 1, so necessarily log, (¢) = 0. As log,(p) = 0, we must 
have 

+ ae re 
log, # = log,(v) = Sey @riyr 


n>i e 


This shows that. if log,, exists, then it is unique. 
It. is harder to prove existence. First, by the previous paragraph we must 


define 
wed (wv =i) 


n 


log, x = log,(v) = Ss“ 1) 


n>t 
if =p’ -¢-v. Note that the series converges, as 


Up (*) 2 nup(v ~ 1) — log, (n) ~» ov. 


Moreover, since the series converges uniformly, it is easy to see that v —+ 
log,(v) is continuous for |v ~ 1] < 1. From here it is not difficult to check that 
x > log, («) is continuous on C5. It remains to check that it is additive. This 
easily reduces to 


log, (1 + u) + log, (1+ v) = log, (1 + (w+ + uv)) 


for |u| < 1 and |v] < 1, which is the tricky point, First, one checks that as 
formal series in X,Y we have 


log(1 + X) + log(} + Y) = log(1+(X +¥4.XY)), 


for instance by differentiating both sides in X, respectively Y. Next, the series 
defining log, (1 + u), log, (1+) and log, (1 + u+ v + uv) converge absolutely 
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and one can permute their terms as one wants, without changing the value of 
the series. This implies that we can substitute ¥ = u and Y = v in the formal 
series equality and finishes the proof of the theorem. i] 


With the same arguments as in the proof of the previous theorem we 
obtain log,(e”) = a if lz] < p Pi {it is easy to check that Je™ ~ I], < 1 for 
such x) and eS) = g if x is close enough to 1 so that up(log,(z)) > pi 

We end this section with another useful p-adic analogue, the binomial 
functions and power fanctions. Define, for x € Q, and nz >O 


Proposition 3.B.21. 1) (Vandermonde’s identity) If x,y € Qp, then 


Cree) ey: 


2) Ife € Zp, then (*) € Zp for all n. 
3) Ifae Cy satisfies lalp <1 and « € Zy, define 


(l+a)? = x Ger 


Then the series converges and x -> (1+ a)" is a continuous additive 
homomorphism from Zp to Cp. 


Proof. 1) lf x,y are positive integers, simply compare coefficients in 
(A+ 7)" = (1 +T)*- +7). 


The result then follows by density and continuity. The same argument works 
for 2). The convergence of the series in 3) follows immediately from 2) and 
theorem 3.B.9. The continuity follows from the uniform convergence of the 
series, while the equality (1 + a)* - (1 + 4)¥ = (1+ a)**® follows either by a 
simple computation using 1) or from the case z,y € {1,2,...} by continuity 
and density. 0 
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Some applications 


We discuss here some immediate applications of the preceding theoret- 
ical results. The reader will probably appreciate better the power of p-adic 
techniques, since none of the following problems are easy to solve by other 
means. 


y Example 3.B.22. (Kiran Kedlaya, USA TST) Let p > 5 and let 


poi 1 


Prove that for any integers 2, y, p? divides the numerator of fp(2) ~ foly) when 
written in lowest terrns. 


Proof. Using the tools previously introduced, this is very simple: working in 
Qp, we can write 


but these congruences have already been discussed in the solution of problem 
22, chapter 3. Co 
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Example 3.B.23. (how not to prove Fermat's last theorem) Let p be a prime 
and let k,N > 1. There exist integers 2, y,2, not all of them divisible by p 
and such that a +y% = 2% (mod p*). 

Proof. It is enough to show the existence of «,z € Zp such that ovata 2%, 
since then z {mod p*),1 and z (mod p*) is a solution. We would like to take 
z=(i+2*)/%, Using the results of the previous section, we are tempted 
to take z = Sosy (¥)2™*. Unfortunately, N is not necessarily prime to p, so 
we cannot apply directly those results. However, 


i 

N \ a NY. 
v x > Naw. 
" ((*) ) is 


and this tends to oo as n > 90 if Nup{x) > rat +,(N). We thus choose such 
x and define z by the previous series. Then z € Zp (by the previous estimate) 
and the usual argument with formal series shows that aN =140%, QO 


— nvp(N) 


Pp 


The next problem has already been discussed in chapter 3, problem 26, 
where two rather difficult sohitions were given. Using 2-adic numbers, it be- 
comes almost obvious. 


Y Example 3.B.24. Write 


2 2 Dee 
i = Dae ak nbn 

for relatively prime integers a@,,b,. Then v2(an) > m — logg(n) (this is the 

ordinary logarithm here!) for n > 2. 


Proof. Let us work in Qa. The series }>,, ae suggests considering log,(—1). 


Indeed, the series defining this is exactly ~ 7, a On the other hand, since 
logy is additive and since (--1)? 


1 and logy(1) = 0, we must have logo(-1) = 
0, that is in Q2 we have the equality 37,5; 4 = 0. But then 


2 ok fe 
v2 2 ) = U9 (- s ;) > inte ~ logg k) > n — loga(n). 


k=l kon 
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3.B.5 A geometric application 


In this section we reward the reader with a mathematical gem, due to Paul 
Monsky. This uses the existence of an absolute value on C, extending the one 
on Qp, a result which was explained in previous sections. It is a nontrivial 
fact: from field theory that Cp is isomorphic as a field with C. The choice of 
an isomorphism allows us to transfer the absolute value ou C, to one on C, 
that still extends the p-adic absolute value on Q. The reader who finds this 
construction very indirect will probably spend some time trying to construct. 
directly such an absolute value on C. Inevitable failure will probably convince 
him of the power of the arguments in previous sections. 


Theorem 3.B.25. (Monsky) One cannot dissect a square into an odd number 
of triangles of the same area. 


It is absolutely remarkable that no geometric proof is known for this pretty 
innocent-looking problem. Monsky’s proof (see {53]) is a stunning combination 
of arithmetic and combinatorics. 


Proof. Consider the square with vertices (0,0), (1,0), (0,1), (1,1). Using the 
extension of the 2-adic valuation to R, color the point (x.y) € R? in red if 
max(|s|2,/yl2) <1, in blue if jajz > max(1,\yl2) and in green if lyly > Jala 
and |ylg > 1. We will repeatedly use the easy observation that translation by 
a red point is color-preserving. 

Here is the crucial point: 


Lemma 3.B.26. IfT is a triangle whose vertices have three different colors, 
then |A(T)|2 > 1, where A(T) is the area of T. : 


Proof. By the remark on translations by red points, we may assume that one 
of the vertices of T is (0,0). Let 6 = (61,62) and ¢ = (e,¢2) be the other 
vertices, say b is blue and ¢ is green. Then 


‘ bycq ~ bee A ! CyB. 
eye = PAS] — atia-leale- 1 2-%| >, 
2 2 Pe. bila 
as |by|2, leaky 2 1 and | - B <i g 
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Consider now a dissection of the square into 7 triangles of the same area, 
which is necessarily 1/n. Color only the vertices of the triangles, as above. 
If we can prove that there is a triangle with vertices of different colors, we 
deduce from the previous lemma that |n|2 < 1 and so n is even. The existence 
of such a triangle is a trivial consequence of Sperner’s lemma, but it is perhaps 
useful to recall how things work in this easy two-dimensional case: consider 
segments on the boundary of the square whose endpoints are red and blue 
(ie. one endpoint is red and the other one blue). All vertices on the edge 
{0,1} x {0} are either red or blue. All vertices on the edge {0} x [0,1] are 
either red or green. All vertices on the edges [0,1] x {1} or {1} x [0,1] are 
either blue or green. Therefore all segments on the sides with one endpoint 
red and the other blue are on the side {0,1} x {0}. As (0,0) is red and (1,0) 
js blue, there must be an odd number of such segments. On the other hand, 
assume that no triangle has vertices of different colors. It is easy to check that 
all triangles have an even number of sides whose endpoints are red and blue. 
As the triangles partition the square, we deduce that the number of red-blue 
segments on the border of the square is even, a contradiction. Thus, there 
must be a “colorful” triangle and the theorem is proved. oO 


3.B.6 Mahler expansions 


One of the miracles of p-adic analysis is that one has a fairly explicit 
description of all continuous functions on Zp. Of course, this is far from 
being true in real or complex analysis, so the following theorem is surprising 
to say the least. It is however absolutely crucial when dealing with more 
delicate aspects of p-adic numbers and we will use it constantly in the following 
sections. 


Theorem 3.B.27. (Mahler) For any continuous function f : Zp + Qy there 
is a unique sequence (an(f))n>0 of p-adic numbers such that limy +o On = 0 


and for all z € Zp 
se) = Danis) (5). 


n>o 


Moreover, we have mitzcz, Up(f(x)) = mitre Up(an(f)). 
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Proof. Note that if the equality 


fe) = aninn(*) 


n>o 


holds for all x € Zp, then for all n 
sm) = Dat(7). 
k=O f 


Either by considering the exponential generating function of (f(n)), and 
(an(f))n or by using the theory of finite differences (see chapter 10, section 
10.3), we deduce that 


nt = Seay*(") 0, 


k=0 


Assume for a moment that we proved that limy,—oo an(f) = 0, which is 
the difficult point of the theorem. Then, since (7) € Z, fore Zp, we deduce 
that 9(z) = S59 @n(f/)() converges uniformly for x € Zy and so g is a 
continuous function. Moreover, by construction g(x) = f (n) for alln > 1, 
so by density of {1,2,...} in Z) we obtain f = g and the first part of the 
theorem follows. Finally, from the previous relations between the values of f 
at positive integers and the a,,(f) we obtain 


el F(n)) = gin vplaul A), rpl@n(f)) > min op( FO), 


so another density argument yields minzez, Up(f(#)) = mingzo tp(an(f)). 
Note that these minima exist, as vp(an(f)) diverges to oo and as f is con- 
tinuous on the compact set Zp. 

It remains to prove that vp(an(f)) -+ oo. As f is bounded (because it is 
continuous and Z, is compact), by multiplying f by some power of p we may 
assume that f(Z,) C Zp. As Zp is compact, f is uniformly continuous on Zp, 
so there is ng such that vp( f(x +p) — f(x)) > 1 for all 2 € Zp. Let 


Af(a) = fle +1) - f(z), 
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then ‘ 
n = aprkl”\ ern k 
arie)= Dwr) se+) 


and @,(f) = A” f(0). As p divides (7°) for all 1 < k < p™, it follows that 
vp( AP” f(z)) 2 1 for all « € Zp and so vp(A"’ f(x)) > 1 for all n > pre and 
alla. The map g = pape J is continuous and g(Zp)) C Zp. Applying the 
same argument to g, we find m; such that vp( AP" g(x) > 1 for all 2. Then 
up(A" f(x)) > 2 for all n > p™ +p™. Continuing like this, we find integers n; 
such that v,(A"f(z)) > d for all n > p' +++-+p"! and all # € Zp. Taking 
x = 0 shows that vp(an(f)) -> oo and finishes the proof. oO 


Remark 3.B.28. The numbers a,(f)} are called the Mahler coefficients of the 
function f. We discussed only the case of Q,-valued functions, but the result 
holds for K-valued functions, where K is any complete subfield of Cy, with 
basically the same proof. 
Here is a nice application of the previous theorem, proposed at the USA 
TST 2011 by Josh Nichols-Barrer. 
Y Example 3.B.29. Let p be a prime. We say that a sequence of integers {zn} 74 
is a p-pod if for each ¢ > 0, there is an N > 0 such that whenever m > N, p® 


divides the sum A 
m 
S t(D ae 


Prove that if both sequences {r,}S29 and {yn}%2y are p-pods, then the se- 


quence {tpYn}Sq is a p-pod. 
Proof. See the sequence zn a8 a map on N in the obvious way. The Mahler 
coefficients of this map are precisely the mumbers 


m 


@m{z) = ¥(-1)* ) Zhe 


katt 


By hypothesis, z is a p-pod if and only if a,,(z} tends to 0 in Q, and by 
Mahler’s theorem this happens if and only if z, extends to a continuous func- 
tion on Zp (namely 2 -+ Dy>9 dn(z) - (%)). But the pointwise product of two 
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continuous functions on Z, is clearly a continuous function, from which the 
result follows. a] 

One also has a characterization of locally analytic functions f : Zy > Qp 
in terms of their Mahler coefficients, though the proof is much more difficult. 
Recall that a function f : Zp > Q, is called locally analytic if for any a € Zp 
there exists n, and p-adic numbers f(a) such that whenever Up(x@—a) > Ne 
we have 


fj= TE Geen. 


Theorem 3.B.30. (Amice) A continuous function f : Zp > Qy is locally 
analytic on Zp if and only if tts Mahler coefficients a,(f) satisfy 


Him Supp ooo 4) |@n(F)lp < 1. 


3.B.7 The p-adic Gamma function: preliminaries 


Recall that the Gamma function is defined for Re(s) > 0 by 


= 
T(s) = [ ete, 
Jo t 


It has the nice property that it interpolates the numbers n!, as P'(n) = (n—1)! 
for any n. One would like to have a p-adic analogue of the Gamma function, as 
this function plays an amazingly important role in real and complex analysis. 
Unfortunately, it is not difficult to see that there is no continuous function 
f: 2, + Q, such that f(n) = {n — 1)! for all positive integers n. On the 
other hand, we have the following beautiful result, due to Morita. To simplify 
the exposition, we will assume from now on that p > 2. There are natural 
extensions of all the following results to the case when p = 2, but that would 
force us discuss two cases in both the statement and proof of the following 
results. 


“This means that there is r > Q such that vp(au(f)) 2 rn for all sufficiently large n. 
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Theorem 3.B.31. (Morita) There is a unique continuous map Tp : Zy + Qp 
such that for alln > 2 we have 


Tew 
To(n)= ("Ts 
j=l 
ged(p,j)=1 
Proof. Defining 
any =(-1y" TT 
gedlpyi)=1 
for n > 2, let us prove the following 


Lemma 3.B.32. g(n + p*) = g(n) (mod p*) for alin and all k > 1. 


We have 
nt ntpk—-1 
gr)—grn+p*)=(-1” TT g-}a+ TT 

jl j=n 

gedtp}=1 ged(jp)=1 

so it is enough to check that 
netprm1 
a+ YT] z 
gedlip)=1 


But if 7: Z~» Z/p*Z is the natural reduction map, it is clear that 


ntpe—i 


“| TL é}=I1e 
j g¢G 
ged(j.p)=1 


where G = (Z/p*Z)". The elements g in the previous product come in pairs 
(9,971), but one has to pay attention to the fact that one might have g? = 1. 
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However, as p > 2, this appears precisely when g = 1 or g = —1. Thus, the 
product of all g’s equals ~1 and we are done. 

The previous lemma easily implies that v,(g(m) -- g(n)) > Up(m — n) 
for all distinct positive integers m and n. Choose any p-adic integer a and 
any sequence zr, of positive integers such that lino @n = @ in Zp. Since 
vp(9(@i)—g(xj)) > up(a4-~a,), it follows that the sequence (g(an))n is a Cauchy 
sequence and so it converges to some p-adie integer g(a). If y_, is another 
sequence that converges to a, then applying the result we have just. obtained 
to the sequence 21,41, £2, y2,..., we deduce that g(y,) converges to g(a), ie. 
g(a) is independent of the choice of the sequence (2p )n. Thus, we obtain a map 
Tp : Zp + Zp which clearly extends g. Passing to the limit. in the inequality 
up(g(m) — g(n)) 2 vp(mm ~ n), we deduce that vp(Cp(z) — Pp(y)) = vp(a ~ y) 
for all x,y € Zp, showing that I’, is continuous. This proves the existence of 
Tp. The uniqueness part is a trivial consequence of the density of N in Z. © 


The following proposition summarizes the basic properties of Tp. We will 
prove much deeper results in later sections, once we have developed enough 
tools. 


Proposition 3.B.33. 1) For all positive integers n we have 


Tp(e+1) = (-1)1__ 2 


[z}t-ol@l 
2) Vp(Zp) © Zs. 
3) If tla) = x forz é Zy and t)(x) = —1 for x € pZ,y, then 
T(z + 1) = 7(z)P,(2). 
4) If & © Zp and r(x) € {1,2,...,p} is the unique integer such that 
x—r(x) € pZp, 


then 
P,(z)-Pp(1- 2) = (~1)"" 
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Proof. 1) This follows immediately by definition of the p-adic Gamma func- 
tion. = 

2) By construction, vp(Ip(n}) = 0 for integers n > 2. As these integers 
form a dense subset of Z, and as vp oT'p is continuous, 2) follows. 

3) This follows immediately from the definition if x is a positive integer. 
The general case follows by density and continuity. 

4) By density and continuity, it suffices to prove that 


P,(-n)Pp(n + 1) = (-1)yttlevel 
for positive integers n. But multiplying the relations 
PG = 9) = m(-DTp(-7) 


from 3) yields 


hoa I T(-9) 
=o IT i 


pli ged(p,j)=1 
= (~1)P/h (ay b(n + 1) 


and the result follows. 


3.B.8 Mahler expansions and discrete antiderivatives 


Exploiting Mahler’s theorem, we develop basic p-adic calculus in this sec- 
tion. This will then be applied to the p-adic Gamma. function and then to 
establish some fairly deep congruences. We start by proving that any contin- 
uous p-adic function has a continuous (discrete) antiderivative. 


Theorem 3.B.34. Let f :Z, -» Cy be @ continuous function, There exists a 
unique continuous function Sf : Zp ~> Cp such that Sf(0) =0 and 


Sf(@+1)— Sf(«) = f(x) 


Mahler’s coefficients of q. 
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Proof. This is very easy using Mahler’s theorem 3.B.27. Namely, look for 


Sia) = ohn () 


n>0 


and observe that 


Sfl@+1)~ Sf(e)= Sob, (, :) = Lobnas (") 


nei 


Taking into account the condition that Sf (0) = 0, we have bo = 0. Also, 
Sf satisfies the desired equation if and only if bp41 = @,(f) for all n > 0. 
Thus any solution must satisfy a,(Sf) = a,— i(f) for n > 1 and ao(Sf) 2 0, 
yielding uniqueness. But this also gives the existence, since if Gn(f) > 0, then 
@_-i(f} > 0 and so Sf(a) = once On {Z) is a continuous function if fis. 0 


Next, we define the notion of integrable and C! class functions, Unfor- 
tunately, there isn’t really a very good notion of p-adic integration and each 
such construction has its limitations. For instance, the one we have chosen in 
this book has the drawback that not all continuous functions are integrable. 
But since we will deal only with sufficiently nice functions, this will be enough 
for our purposes. 


Definition 3.B.35. 1) A function f : 2, + Cy is called Volkenborn inte- 
grable (or simply integrable) if 


: ; 1c 
I fe)de = tim oD, £0) 


j=0 


exists in Cp. 


2) Say a continuous function f : Ly ~+ Cy is of class C! if 
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These definitions might seem a bit strange at first. However, the average 
whose limit defines the integral of f should be seen as a Riemann sum. So 
Volkenborn integrable functions are precisely those for which Riemann sums 
converge. The definition of C! class functions is a bit more subtle, but one can 
prove (although this is nontrivial) that it agrees with the intuitive definition. 
We will prove part of this assertion when proving the next theorem. The 
following remark will play a very important role in the proof. 


Remark 3.B.36. If x #y € Zp, then 


= (7) ~ (‘)) < fants 


Indeed, Vandermonde’s identity (proposition 3.B.21) yields 


SOs) 20) 


ESOC) 

—n- 9G nm-~jo-d 

which immediately yields the result (using the fact that @) € Zp if x € Zy). 
Note that 


1 
Hem(1, 2,...,2)[p = pret "] > =, 


so we obtain the very useful estimate 


£ y\i 
= <nic— 
() (“) | Snlz—vlp 


The following result is fairly technical, so the reader might want to skip 
the proof at a first reading. 


Theorem 3.B.37. A function f of class C! is integrable and 


e (~1 ye 
ff » ere tales 


Moreover, in this case we also have: 
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1) f is continuously differentiable, ie. f' (c) = lim, 4, LY) 
C, for all x and x++ f'(x) is continuous. =G 


2) We have 


exists in 


sere) = f (f@+w ~ slw)du. 


“P 


Proof. The observation that 


te 4 
a » $4) = 5 5F0") 
1 pe 
=n”) 
Laas! k 
= 57 cealh) (2 ~1 
ok \ ke P 
yields the estimate 
| 4 
im Seawoad 
— fY- ax (f)) <su 
QP » » rare naa S SUP Ent 
P 
where 
_, le~i(P)lp | fp? = 1 
sua = Sele (#—2) ays 


P 

Thus, to establish the first fornmla of the theorem, it is enough to check that 

SUP; Fn,4 tends to 0 as n + oo. Clearly, ca Sk, 80 tay < klax ip 
7 ks AS )ip. 


Moreover, since (~1)*-! = (=), remark 3.B.36 yields an < lens (f) |p S- 
Putting everything together, we deduce that i" 


Tak S Klak-a(f)|p «min (. Ba ) 


p 


Combined with kla,(f)|p > 0, this easily proves that SUP; Znz — 0, estab- 
lishing the first formula. ; 


172 Chapter 3. Look at the Exponent 


1) By remark 3.8.36, there are polynomials P, € Q{X,Y] such that 


@)-@ = P,(u, 2) 
U-xr 


for all w, 2 and |P,(u,2)|p <n. Since nlan(f){p -> 9, it follows that the series 


$0) =F) ap) Palen) 


Uae 


converges uniformly and its limit as u -> x is 7,59 @n(f)Palw. x), which is 
continuous (as the series converges uniformly). : 

2) First, we compute the Mahler coefficients of ge(u) = f(a +u) — f(u) 
by using Vandermonde’s identity 3.B.21: 


neo n>o 
S an(f) 2 fat u 
et pa) 6 | ea 
nlf d i} \n-i 
rel t=] 


= » ange) (;) ’ 


where 


anlar) = Dont(,,” 4): 


n>k 
Thus klag(ge}lp S SUPpsn MAn(f)lp, 80 ge is of class C! and we can express 


Gla) = ff (Fe+u) - Few)ydu 


in terms of Mahler coefficients of g,, using the first formula of the theorem: 


ow fo -1)" iz 
G@)= 0 b +1 omen) (7): 


m=1 Ln=0 
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It is apparent from this formula that G is continuous. F: inally, if 2 > 1 is an 
integer, we can write by definition 


pri 


Gla) = Jia Yo fle +9) = FU) 
j=0 
red 

= Jim LUC" +3) -F0) 
j=0 


The conclusion follows now by continuity of G and S$ 'f’ together with the 
density of the set of positive integers in Zp. o 


3.B.9 Application to the p-adic I-function 


We are now ready to prove the following deep theorem, that will allow us 
to prove very difficult congruences: 


Theorem 3.B.38. log, oP, is an analytic function on PLy and has @ power- 
series expansion. 


l wz) = Apa ~ fa a a 
Dey Tpke) Ame > an(2n + 1” ; 


where 
ke 
pet 
1 
z 


ee = = ae ah 
r= ff log, (u)du, m= ff x de> ie DS ue 


desl 
gedlip)=t 


Proof. In this proof we will use the sirnpler notation {x} for ja|p. Using propo- 
sition 3.B.33, we deduce that if f(a) == log, (T(z), then 


F(t +1) = f(z) = 1yq)24 log, (2). 


| 
| 
i 
fr 
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This functional equation combined with the fact that (0) = 0 (this follows 
from proposition 3.B.33) shows that f = SA, where A(a) = = He|x1 log,(x). 
But it is easy to find an antiderivative of A: reasoning from the original power 
series, one chooses B(x) = 1),).1(x - log,(x) — x) and checks that B’ = A. 
Note that thanks to the hypothesis « € p&y, we have |u|, = 1 if and only if 
lu+ lp = 1, for all u € Zp. Therefore, theorems 3.B.30 and 3.B.37 yield the 
integral representation of f: 


log, (I p(x)) = | Ljujaa ((@ + u) log, (a + u) — (@ + u) ~ wlog, u + u)du. 


p 


Using the fact that log, (a + u) = log, u+ jog, (1 + 2/u) and expanding 


(-1)""? an 
log,(1 + 2/u) Pee a 
for |u} = 1 yields 
(x + u)log,(z + u) ~ x —ulog,u 
(-)r gm 
= (x +u)logyu + (e+u)- $7 — =~ B~ log, u 


n>1 


_pjr~l ntl nk n 
= rlog,u-z4 x! ‘ = | > yee ee Z 


ye i 
nO” n>2 


and an easy calculation (based on the change of variable n — 1 = j in the last 
sum) yields 
Lujaa (e+) log, (2+ u) ~ (+4) — Bee a = lyn log, ut > fae(u) u), 
not 
where 
Fralu) =, Som Ly te 


Of course, we integrate this, but the difficult point is to check that the integral 
commutes with the infinite sum. Again using theorems 3.B.30 and 3.B.37, we 
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can write 


ae oa axl fr, 2) 
REO n>1 
-1)* — 
= (D1) 


all manipulations being easily justified by the fact that Frye takes small values 
{and so its Mahler coefficients are small). We finally deduce that 


log, T'p(a) = w+ Ag + Day fagx(ujdu. 
n=l 
It remains to simplify this a little bit thanks to the following 
Lemma 3.B.39. If g is of class C! and odd, then 


rd 
i g{uldu = £0) 
Zp 2 


Proof. We leave this as an easy exercise to the reader, We give a hint, however: 
write 


oe" — 9) = —gG — p") = —(g(3) — p"g'(3) + ofp")) 
and observe that pars 9G) = (S9')(p*). a 


Applying this to the functions faye with n odd, we obtain that 


[ faz(ujdu = 0 
2p 


for n odd. Putting everything together, the result follows. 
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We would like to have more information about the numbers 4, that appear 
in the previous theorem. A first step in doing this is the following lemma: 


Lemma 3.B.40. We have 
An ~ (A- poy f a" dx € Zp. 
Zp 


Proof. Using the fact that g - g™! is a permutation of the finite group 
(Z/p*Z)*, we deduce that 


pet pL pe-1 prota} i 
G72" Qh z en —p™ > yan (mod D Zp): 
i=l ih i=0 =O 

gcd(i,p)=1 ged(ip)=1 


Dividing by p* and letting k + oo, the result follows. 


Ba= 7 ada. 
Ep 


By linearity of the Volkenborn integral we can write for n > 0 


EC LECH )« 


k=0 


Next, let 


=f (etiy—2 Yar 
Zp 


which immediately yields the exponential generating function of the sequence 
(Bada: 
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This allows us to compute Bo = 1,B, = —}, By = } and to deduce that all 
By are rational numbers. Actually, these numbers B, are called Bernoulli 
numbers and appear all over the place in mathematics. Their rather subtle 
arithmetic properties will help us prove the following important 


Theorem 3.B.41. One has py, € Zp for alln > 1. Moreover, if p > 3, then 
AVE Zp. 


Proof. Using the previous lemma and discussion, it suffices to prove that 
pBr € Zy for all n (the second part follows from the observation that By = 2 


is prime to p if p > 3, so we will not say more about it). We will prove this 
by induction on n. For n = 1, it is clear, so assume that it holds for j <n. 
Recall that 


Let k be a positive integer. Multiplying this by e** yields the equality 
q] 


ebX XK XP SS fn’ iad 
a7 (fae. 


AX 
4 = x 7 +X(1 +e% pe. tel DX), 


Identifying the coefficients of X"* in this equality shows that for all n > 1 


" n 1 Gfati nflmi 
L428 bo (b= 1)" = PY Botts, 
i=0 


+1 i 
Take k = p and note that a CEB = (PB) () BS € Zy for lO <i < 


n~— 1, Combining this observation and the previous equality, we deduce that 
PBn € Zp, finishing the proof of the theorem. 
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3.B.10 Some deep congruences 


p-adic mumbers play a crucial role in establishing difficult congruences, 
which are almost impossible to get by other means. The following theorem 
is a very famous such example. We present here Robert’s and Zuber’s nice 
proof, following [68]. 


Theorem 3.B.42. (Kazandzidis) If p > 3, then for all positive integers n,k 


we have 
(2) =() ome 


where j = 3 + vp(nk(n ~ k)) + up((Z)). 


Proof. Proposition 3.B.33 shows that for all positive integers z we have 
Tp(pr) = (— 


Therefore, if we denote « = kp and y = (n — k)p, we have 
Ge) _ Tole ty) ; 
(i) Pe(e)P py) 


The definition of log, (x) as a power series shows that |x ~ 1], = | log,(x)jp for 
all a € 1+ pZ,. So, if we let f(x) = log,(Tp(x)), the congruence we have to 
prove is equivalent to 


up f(a +y) — fle) — f(y) 2 vp(ey(x + y)). 
But theorem 3.B.38 yields 


Up f(a + 9) ~ F(x) ~ Fm) 


An Qntl — 2nt1 _ nth 
>(Cadtnt ype — aed yon 


ae 
2 jaf Vp(An); 


Ee 
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where n 
— An ey )2ntl 2nd Qn] 
Gn In(dn +1 +y) x yn"). 


It is thus enough to prove that vp(@n) > vp(xy(x+y)) for alln > 1. Forn = 1, 
this follows from A; € Z», which was proved in theorem 3.B.41. So, suppose 
that n > 2. Since (X + 1)?"+1 — X?n+1 _ 1 vanishes at 0 and —1, there is 
f € ZX] of degree 2n ~ 2 such that 


Since y*""* f(x/y) is a homogeneous polynomial of degree 2n — 2 with integer 
coefficients in 2,y € pZp, we deduce that 


vpl(a-ty)rtt — ght — 241) = a (oy(@ + y)) + 0p (vrs ()) 
= up(ry(a + y)) + 2n 2. 
It is thus enough to prove that 
Up(An) + 2n — 2 > vp(2n(2n + 1)), 
which follows easily from vp(A,) > ~1 (theorem 3.B.41) and the inequalities 
Up(2n(2n + 1)) < max(vp(2n), vp(2n + 1)) < logs(Qn + 1) < Qn ~ 3, 
the Jast one being true for n > 2. Oo 


Similar techniques apply to prove the following nice congruence, taken 
from [17]. 


Theorem 3.B.43. Ifp > 5 and n> 1, then 


= ((p— 1)" (mod p3t%el""-”)) 


and this does not hold modulo piter(nt~n) unless p is a Wolstentholme prime, 
te. (RY) = 1 (mod pt). 
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There are similar sharp congruences for p = 2,3, 5, for which we refer the 
reader to [17]. 


Proof. (sketch) As in the proof of the previous theorem, one starts by noting 
that, 


Talnp) = (yr PE, Tylp) =~ 1), 


so the congruence reduces to 
Up (Cp(np) —Tp(p)") > 3 + vp(n — n). 


If f(z) = log, (T'p(x)), the equality up(y— 1) = up(log, y) (true for y € 1+pZ,p) 
reduces the problem to showing that 


vp f(np) ~ nf (p)) 2 3+ vp(n? ~ x). 


One uses again theorem 3.B.38, but one also has to analyze the term containing 
dz. As the details are routine, they are left to the reader. ima) 


Here is yet another similar congruence, with the same idea. 


Theorem 3.B.44. (F.Rodrigo- Villegas) Let (an)n be a sequence of integers, 
finitely many of which are nonzero and such that Dn>t Nan = 0. Define 


A(n) = [[Gen)!*. 


k>1 


Then for any prime p > 3 and any 3 > 1 we have 


Proof. (sketch) The point is to study log, (#2) by expressing it in terms 

of values of the function x +> log, I'p(x). For instance, if s = 1, then this equals 
pp 

So, cx log, Tp(kp). One finishes as usual by using the analytic expansion of 


f(z). a 
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Theorem 3.B.45. Let n be a multiple of an odd prime p. Then we have an 
equality in Qp: 


nod 
1, Aj, 95 
2, E77 Ly 
ged(k.p)=1 
where \; are as in theorem 3.B.38. 
Proof. (sketch) Note that f(r) = log,(Pp()) is locally analytic on Zp, as 
follows from theorem 3.B.38 and from the functional equation 
f(e4+1)-f(z)= Ljaj,=1 log, x. 
It follows easily that f is differentiable and that g = f’ satisfies 


1 
g(@ +1) — g(x) = Vatp=t 2 
Thus 


rw rT 
Zz an) -9(0- 
k=l ged(k,p)=1 


Next, by differentiating the Taylor expansion of f in theorem 3.B.38, we obtain 
that for # € pZ»p 


Delas 
g(a) = do ~ S753”. 
jai“? 
The result follows by combining these two identities (with @ = n in the second 
one) and by observing that g(1) = g(0) = Ao. 


3.B.11 More on Bernoulli numbers 


In this section we focus on some classical congruences satisfied by 
Bernoulli numbers. Recall from section 3.B.9 that they are defined by 
B, = te, x"dz. We saw that (Bn), is a sequence of rational numbers, with 
exponential generating function 
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Lemma. 3.B.39 shows that B, = 0 for all odd numbers n > 1 (this can also 
be deduced from the generating function, by checking that the function 2 —> 
oy + ¢ is even). We saw during the proof of theorem 3.B.41 that Bernoulli 
numbers satisfy the crucial identity 


Sq(k) = 1842" $e +R 1)" 


od nth ones 
- Td i ) Bik , 


i=O 


This identity will play a crucial role in the proof of the following beautiful 
result, which considerably refines theorem 3.B.41. 


Theorem 3.B.46. (von Staudt-Clausen theorem) For alin > 1 we have 
Bat D5 
pollen? 
Proof. Let p be any prime, We need the following elernentary result: 
Lemma 3.B.47. Ifn is even, then for allk > 1 we have Sp(p**!) = p-Sp(p*) 
(mod p**), 


Proof. This is a simple application of the binomial formula: 


pRomL pool 


= 00" + apt) 


j=0 [20 
= pSn(p*) + 5 (p— Lp Sy (p*) 
= pSn(p*) (mod htt), 
Note that if p is odd, then the hypothesis that n is even is useless. a 
Next, we claim that Sele ~— Bay, € Zp for all primes p and all n, Note 


3 on (p 
that the previous lemma ee that cone = Sages € Zy for all k > 1, 
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hence it is enough to show that we can find k such that Seip ~ Bon € Ly. 
But 
“1 
Son(p*) 1 Sl font ane 
pad ie aot BypkOn-7) 
py B= oi i 3p 


is certainly in Z, for k large enough, as each of the terms in the sum is in Zp. 
This proves the claim. 

Finally, a standard argument’ shows that S;,(p) = 0 (mod p) unless p~ 1 
divides n, in which case S,(p) = ~1 (mod p). Combining this with the result 
of the previous paragraph, we deduce that Bg, @ Zp» unless p— 1/2n, in which 
case Bon + ? € Zy. All in all, this shows that the rational number Bo, + 
Dyan 4 belongs to Z» for all primes p and so it is an integer. The result 
follows. a 


The following classical result is cousiderably more difficult to prove. The 
proof is actually rather mysterious, though elementary, It is highly related to 
the existence of p-adic analogues of the Riemann zeta function, though this 
may not be apparent at all... 


Theorem 3.B.48, (Kummer’s congruences) Let m,n be positive integers and 
let p be a prime such that p~1 does not divide n, but p~1 divides m~n. 
Then 8m — Bn € py. 


Proof. It suffices to prove that 3 = feo} (mod p) whenever p— 1 does not 
divide k > 1. Note that it is not even clear that Be € Zp, but this will be the 
case (of course, this is clear if k is odd, as then Bo = 1). Let us fix an integer 
isa<p-1 such that a is a primitive root modulo p. The following lemma 


crucial. 


‘Let g be a primitive root mod p. If p-~ 1 does not divide n, then g" 41 and 


Ree gly 
Sn{p) = ya oa g = = 
20 g 
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Lemma 3.B.49. There exist by € Zp such that (as formal series) 


a 1 = x n 
BELT ETT = le - 


n>o 


Proof. By changing the variable Y = e* — 1, this is equivalent to 
? y= hy 
oe es +) Y™. 
(+Yje-1 Y ou 


The binomial formula and the fact that gced(a,p) = 1 yield the existence of 
9 € YZ,[Y] such that (1+ Y¥)* — 1 =a¥(i+g(¥)). We then have 


1 1 1 n* n 
dat d Cer) - 1)= Dye" 


n>i 


The result follows, as g € YZ,[Y]. 


Let use denote U;, = (a” —1)- Ba and observe that by definition of (Bn)n 


we have 
= Titi aah 


n2o 


To fully exploit lemma 3.B.49, let us denote 
mt 
i{m\. 
amp = RMX — yh = o-ays() 
j=0 


so that 


Cra = Sob; * 83n 


5Note that. ‘there are no convergence issues, aS sn, = 0 for n > k. 


3.B. An Introduction to p-adic Numbers 185 


for all n > 0. 

It is ait easy to conclude. Recall that p ~ 1 does not divide k, so p does 
not divide a* — 1. The previous relation and the fact that bj,sjn € Zp, for 
all j,n show that Uz € Zp and so 4 € Z,. The same sreainel shows that 
Peet € Zy. Also, Fermat’s little ets yields sjn = 8jn4p-1 (mod p) for 
all j = O and all n > 1. Hence U, = Unip-1 (mod p) for alln > 1. As 

aktP-T = gk (mod p), this congruence is equivalent to 


Bey p- 
(a — f= (ah — 1) PT (mod p). 


The ceed result follows from this congruence and the fact that p does not 
divide a* ~ 1. aq 


Remark 3.B.50. Here is the real meaning of this proof. As we have already 
said, the crucial fact is that f.(X) = aexeci — & © ZpllX]], say fo(X) = 
ease 5,X”. This allows us to define a measure jig on Zp (i.e. continuous 
linear form on the space of continuous functions g : Zy ~> Qp) by 


; Gha = > On (G)On 
Zp n>0 


for all continuous functions g : Zp + Qp. Here an(g) is the nth Mahler aia 
cient of g and the series converges by Mahler's theorem (as limp oo @n(g) = 
and 6, € Zp). Since by, € Zp for all n, Mabler’s theorem shows that 


vp ( hi an) > min wp(o(2)) 


for all continuous maps g. 
Note that s,;, = a,(2*), so the equality Unya = Ljzo 5355. established 
during the proof can be written as 


On+1 -{ 2" ia. 
@, 


‘p 
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Since vp(x” — 2?+P-!) > 1 for all z € Z, and n > 1, the previous paragraph 
shows that U;, = Uyip,—1 (mod p) for alln > L. 

This new interpretation of the proof of Kummer’s congruence yields other 
nontrivial congruences. Assume for instance that m =n (mod p’ (p—1)) and 
m,n > N, By Euler’s theorem we have v,(2™ — 2") > N +1 for all x € Zp 
and we deduce that Uj, = Un (mod p’+?), This observation is the beginning 
of the construction of the p-adic zeta function of Kubota and Leopoldt. 

We end this addendum with an application of the previous results to 
harmonic numbers, It is standard that for any prime p > 3 we have 


1 
Ss E= 0 (mod p”) and p= @ (mod p). 


The following result considerably refines these congruences. 


Theorem 3.B.51. For all primes p > 3 we have 


pol pol 2 

1 2p ,. 1 
5 p= FB pa (mod p*) and } ‘= = Bp-s (mod p*). 
k=1 k=l 


Proof. Fuler’s theorem yields 


We will use the following general result: 
Lemma 3.B.52. Ifp > 3 andn > 1, then 


2 
14.2" 4...4+(p—1)" = pBy + “F 


Proof. The left-hand side is equal to 


1 atl fy +1 np’ B, =] met n pF 
( k ) Base! = pBy, oF arta + > k-1 Batik 
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It is thus enough to prove that (4°31) Bovine € ph, for all 8<k<n+1. 

As pBn41-k € Zp (by the von Staudt-Clausen theorem) and (,",) € Zp, it is 
2 = 3 s aes 

enough to check that &— € Z,. This is easy and left to the reader (here one 


crucially uses the hypothesis p > 3). O 


Applying the previous lemma to n = y(p”) —2 and noting that: By; = 0 
{as n is even), we obtain 


pot 


2) 9 
KP)? Se PByip2)-2 (mod p*). 
kel 


It remains to use Kummer’s congruence to obtain Bo(p2)-2 = 2B 3 (mod p) 
and finally 


pe = Bp-3 (mod p?). 
kot 
One could apply a similar method for the second congruence, but we 
prefer to deduce it from the first one. Note that 


ply pol 1 1 pol 1 
2) i= =4 =p S- ~ 
i Pies) dpa 


Using this identity and the congruence we have just proved, it is enough to 
prove that 


prt 
ye (ae + Z| = (mod p*) 


os) 


{mod p) 


The last congruence is also standard and it is proved using primitive roots 
mod p. i 


Chapter 4 


Primes and Squares 


This chapter is concerned with arithmetic properties of primes of the form 
4k +1. It is very elementary and most problems use the following two results. 


Theorem 4.1. (Fermat) Any prime number of the form 4k-+1 can be written 
as the sum of squares of two integers. 


There are many proofs of this classical result, see for instance the first 
example in chapter 4 of [3]. For a different proof, using properties of Gauss 
and Jacobi sums, see the addendum 9.A. 


Proposition 4,2, Let p be a prime of the form 4k +3 and let a and b be 
integers such that p divides a” + 6?. Then p divides a and 6. 


Proof. If p does not divide a, there exists c such that ca = 1 (mod p). Then 
(bc)? = ~1 (mod p), thus 


(-1)°F" = (bey =1 (mod p), 


the last congruence being Fermat’s little theorem. But this is clearly impossi- 
a E 
ble, since by hypothesis (-1)°2" == —1. The result follows. oO 


An easy consequence of proposition 4.2 is that up(a? + 62) is even for any 
prime p of the form 4k + 3 and for any integers a,b. This is a very useful tool 
when studying some diophantine equations, as the following problems show. 
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1. Prove that the number 4mn — m --n cannot be a perfect square if m 
and n are positive integers. 


Fermat 


Proof. If 4mn ~ m ~n = x? for some integer 2, then 
(4m — 1)(4n ~ 1) = (Q2)* +1. 


But there exists a prime p = 3 (mod 4) such that p|4m ~ 1. Then p divides 
(2x)? +1, contradicting proposition 4.2. go 


2. Prove that the equation y? = 2° — 4 has no integer solutions. 
Balkan Olympiad 1998 


Proof. Write the equation as 2° +25 = y?+6?. We claim that there is always 
a prime p = 3 (mod 4) such that up(x° + 25) is odd. Since for any such prime 
p we have v,(y*+6?) = 0 (mod 2), we will thus get a contradiction. Note that 
z is odd, otherwise x = 2x1, y = 2y; and 8 divides y? +1, which is impossible. 
If « = 1 (mod 4), there is a prime p = 3 (mod 4) such that vp(c +2) = 1 
(mod 2). One cannot have p|z* ~ 223 +42? ~ 82-416 (otherwise p would divide 
5-24), so up(a + 25) = vp(a + 2) = 1 (mod 2) and we are done. If « = -1 
(mod 4), then z4+--223+.,,.+16 == 3 (mod 4) and we can repeat the argument 
by taking a prime p = 3 (mod 4) such that up(c*~2a34---+16) = 1 (mod 2). 
The claim being proved, the result follows. QO 


Proof. We can work modulo 11: since z!! = x (mod 11) for any x, we deduce 


that 2° = 0,1,~1 (mod 11) for any x. On the other hand, the quadratic 
residues mod 1} are trivial to find: those are 0,1, 4,9, 5,3. One readily checks 
that the equation has no solution mod 11 using these observations. So, it does 
not have integral solutions either. Oo 


3. Solve in integers the equation a? == y? + 7. 
Titu Andreescu, USA TST 2008 
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Proof. We clearly have no solutions for y < —1, so let us suppose that y+2 > 0. 
Taking the equation modulo 4, we easily obtain that y = 1 (mod 4), The key 
point is to rewrite the equation as 27 + 11? = y’ +27 or, by factoring the 
right-hand side, as 


w+ 11? = (y+ 2)(y° ~ 29° + 4y4 — By) + 16y? — 32y + 64). 


Since y = 1 (mod 4), we have y + 2 = 3 (mod 4), thus there exists a prime ¢ 
such that v,¢(y + 2) is odd. Note that q does not divide 


yf — 2y? + 4y! — 8y9 + 16y? — S2y + 64, 


as otherwise q would divide 7-64 and x? +-11?, a contradiction. Thus vg(y" +27) 
is odd, which is impossible, as it equals v,(z? + 117) and q = 3 (mod 4). The 
result follows. oO 


4. Find all pairs (m,n) of positive integers such that. 


m? .~ 1|3 + (n! — 2)", 


Gabriel Dospinescu 


Proof. First, assume that n > 2. Then m cannot be odd, since otherwise 
8 would divide m? — 1, but 37+ (n! — 2)” is odd. So m is even. But then 
m?~1 == ~1 (mod 4), so there exists a prime p = —1 (mod 4) dividing m?—1. 
But then p divides 3” + (n! ~ 2) and since m is even, this implies that p 
divides 3 and n! ~ 2. Thus 3 divides n! — 2, a contradiction. 

Thus, we must have n = 1 or n = 2. If n = 1 then either m? —1[3" +1 
and m is even or m? — 1/3” — 1 and m is odd. In the first case we can use 
the same argument as before to get a contradiction (choose a prime p = —1 
(mod 4) dividing m* — 1), while the second case is impossible since 3” — 1 is 
not a multiple of 8 when m is odd. Thus n = 2 and m? — 1/3”, Thus there 
is k < m such that (m—1)(m +1) = 3*, But then m~ 1,m+1 are powers 
of 3 which differ by 2. Thus clearly m—1 = 1 and m = 2. We deduce that 
m= n= 2 is the only solution of the problem. 
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ety 


5. Find all pairs (a, y) of positive integers such that the number - is 


a divisor of 1995. 
Bulgaria 1995 


Proof. Note that 1995 = 3-5-7-19. The key observation is that 3,7, 19 are all 
primes of the form 4k +3. If p is any of these primes and if p divides oy 
then p divides =? + y? and so it divides x,y. Writing x = pri,y = pyi, we 


obtain 
Pay attat 


E~y aH 


Doing this for every prime factor p of Se and noting that 2? + y? = ay 
has no solutions with z,y > 1, we just have to solve the equation se = 5. 
This is easy, since we can write it as (22 — 5)? + (2y +5)? = 50 and since 
50 can only be written as sum of two squares in two ways 1 +49 = 25 + 25. 
Thus 2y+5 € {—7,~5,-~1,1,5, 7} and since y is positive we must have y = 1. 
But then z = 2 or x = 3. Putting everything together, we deduce that the 
solutions are all pairs (2k, k), (3k, k) with & € {1,3, 7,19, 21,57, 133,399}. O 


6. Find all n-tuples (a1, @2,..., a) of positive integers such that 
(ai! — 1)(@2! — 1)... (@n! — 1) — 16 
is a perfect square. 
Gabriel Dospinescu 
Proof. Suppose that 
last ~1)(aq!—1)...{an!— I) - 16 = FB. 


First, we claim that a; € {2,3} for all 4. It is clear that a; 4 1 for all 7, so 
assume that a; > 3 for some «. Then a;!— 1 = 3 (mod 4), thus there is a 
prime p = 3 (mod 4) such that p divides a;! — 1. Then p divides k? + 47, a 
contradiction. Say m among the numbers 4; are equal to 3 and the remaining 
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ones are equal to 2. The equation becomes 5” — 16 = k*. As k is odd, a 
consideration mod 8 shows that m is even. But then (5"/?—k)(5™/24-k) = 16, 
which easily implies that m = 2, Thus the sequence (aj, a2,...,@,) consists 
of two numbers equal to 3 and the remaining equal to 2. Clearly, all such 
sequences are solutions of the problem. oO 


7. Prove that there are infinitely many pairs of consecutive numbers, no 
two of which have any prime factor of the form 4k + 3. 


Proof. Since no number of the form n? +1 has a prime factor of the form 
4k + 3, it is clear that the pairs ((n? + 1)®, (n? + 1)? +1) yield solutions of the 
problem. Oo 


The following problem is trickier and its solution uses the theory of Pell 
equations. 


8. Let p be an odd prime. Prove that p 
are integers z,y such that 2? — py" = 


1 (mod 4) if and only if there 


Proof. One direction follows directly from proposition 4.2, so let us assume 
that p = 1 (mod 4). The key point is to consider the positive Pelt equation 
x* ~ py? = 1. By the general theory of Pell equations, this has a smallest 
nontrivial solution (zo, yo) (so yo > 0 is minimal among all solutions with 
y # 0). Note that xo is odd, since otherwise 4 would divide uti. if 
Zo = 2a +1, we deduce that yo is even and a? +a = pb? for b = e If 
p divides a, then a = pe? and a+ 1 = d? for some integers c,d (because a 
and a +1 are relatively prime), so that d? ~ pc? = 1. But obviously ¢ < yo, 
contradicting the minimality of (rp, yo). Thus p divides a+1 and we can write 
a= c*,a+1= pd? for some integers c,d. But then c? — pd? = —1 and we are 
done. oO 


We continue with another beauty, which became classical in mathematical 
contests. It has the nice feature of having a completely elementary solution 
that uses quite a lot of different ideas. 
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9. Find all positive integers n such that the number 2” — 1 has a multiple 
of the form m? +9. 


IMO 1999 Shortlist 


Proof. Assume that 2” — 1 divides m? +9 for some integer m and that n > 2. 
Then the only prime factor of 2" — 1 which is 3 modulo 4 is 3. Indeed, if p 
is such a prime, then p divides m? + 3? and so p divides 3. Now, if n has 
a nontrivial odd divisor d, then 247— 1 = -1 (mod 4) and 3 does not divide 
2¢— 1. Thus 2¢ — 1 has a prime factor p = 3 (mod 4) different from 3. Since 
2¢ — 1 divides 2" — 1, Choosing m = 3a, it is enough to find a@ such that 
(2? + 1)(24 +1)...(2*" +1) divides a? + 1. this is impossible, by the previous 
remark. Thus, n must be a power of 2. Conversely, assume that n = 2* and 
observe that 
2 —1=3- (2 1/24 $1) (2 $4). 


Choosing m =: 3a, it is enough to find a such that (27+1)(2*+1)--- ("" 4.1) 
divides a?+1. The crucial point is that the Fermat numbers 2?" +1 are pairwise 
relatively prime, so by the Chinese Remainder Theorem it is sufficient to prove 
that for any i there is a such that a? + 1 = 0 (mod 2” +1). But this is clear, 
since a = 2”"* works. Thus, the answer to the problem is: all powers of 2. O 


The next problems are concerned with properties of sums of two squares. 
A crucial fact is that the set of numbers which can be written as a sum of two 
squares of integers is closed under multiplication, by Lagrange’s formula 


(a? +b?) (c? +d?) = (ad + be)? + (ae ~ bd)”. 


Actually, combining theorem 4.1 and proposition 4.2, it is easy to completely 
characterize the elements of this set: they are precisely the nonnegative inte- 
gers n such that v,(m) is even for all primes p = 3 (mod 4). 


10. Prove that a positive integer can be written as the sum of two perfect 
squares if and only if it can be written as the sum of the squares of two 
tational numbers. 


Fermat 
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Proof. One direction being clear, let us prove that if n is the sum of the squares 
of two rational numbers, then it is also the sum of the squares of two integers. 
Thus, we know that na? = b? +c? for some integers a, b,c with @ 0. For any 
prime p, we deduce that 


Up(m2) + 2up(a) = up(b? +c), 


so that p(n) has the same parity as vp(b? + ¢*). Since for any prime p = 3 
(mod 4) we have vp(6? + ¢?) = 0 (mod 2), it follows that for any such prime 
p we have v,(n) = 0 (mod 2). The result now follows from the preliminary 
discussion. Oo 


Remark 4.3, Actually, the following result holds: if n > 2 is an integer and if 
an integer x can be written as a sum of n squares of rational numbers, then it 
can also be written as a sum of n squares of integers. For n = 3, this follows 
from Davenport-Cassels’ lemma ([3], chapter 13, example 12), while for n > 4, 
this follows from the famous theorem of Lagrange, according to which any 
nonnegative integer is the sum of four squares ({3], chapter 13, example 5). 


11. Prove that each prime p of the form 4k +1 can be represented in exactly 
one way as the sum of the squares of two integers, up to the order and 
signs of the terms. 


Fermat 


Proof. The fact that any such prime p is a sum of two squares is the content 
of theorem 4.1. Let us focus on the uniqueness part. Assume that we have 
p=0+yP att. Then (e~2)(x+z) = (t—y)(t+y). We will need the 
following very useful 


Lemma 4.4. If a,b,c,d are nonzero integers such that ab = ed, then there 
are integers m,n. p,q such that a = mn,b = pg,c = mp,d = ng. 


Proof. Let ¢ = ¢ a4 4 be the representation of the fraction £ in lowest terms. 
Since ap = nc, we have pic, so we can write ¢ = mp for some m. Then also 
a=mmn. Doing the same with dp = nb yields the conclusion, 
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Coming back to the problem, assume that |x| 4 |z| and |¢] # jy! Then 
by the lemma we can find nonzero integers m1, 71, p1,q1 such that 
g-z=mm, c+z=mn, t-y=mpi, t+y= man. 


But then 


myny +P a ag = AP 


C= 2 7 4 3 ’ 


so that using Lagrange’s identity we obtain 


Ap = A(w + y?) = (m3 + 4? )(n? + pi). 


We may assume that p divides m? + q?, so that n} + pj is equal to 1,2 or 4. 
As m,.ni,p1,q) are nonzero, we obtain a contradiction unless ny = py = £1. 
But in this case we get x — 2 = +(¢—y) and «+2= 4(¢+y), thus 


{le], [21 = {lel tv} 
and we are done again. Oo 


We continue with an easy exercise in Lagrange’s formula. 


12. Prove that the equation 3* = m? +n? +1 has infinitely many solutions 
in positive integers. 
Saint-Petersburg Olympiad 


Proof. Guided by the formula 
3% — 1 = (2? + 2)(3? +1) +. (87 +1), 


we will choose k = 2%. Since all factors in the product are sums of two 
squares and since the set of numbers which are sums of two squares is stable 
by multiplication, it follows that 3°" — 1 is always a sum of two squares. Since 
it is trivially not a perfect square (it is of the form 3k + 2), the conclusion 
follows. 
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Proof. We will show that 32° = m? +n? +1 has infinitely many solutions. 
Indeed, start with the observation that 371 = 2? + 2? 1. On the other hand, 
if (k,m,n) is a solution with m > n, then (2k,3*m ~ n,3®n + m) is also a 
solution. Indeed, this follows from 


(3*m—n)? + (3'n+m)? 41 = (878 4-1)(m? 4+ n2) 41 = 9% 14153". 9 


In the following two problems we will use the fact that the density of the 
set of positive integers all of whose prime factors are of the form 4k + 1 (or 
4k + 8) is zero. This is a nontrivial result, for a proof of which we refer to [3], 
chapter 4, example 10. 


13. It is a long standing conjecture of Erdés that the equation 


has solutions in positive integers for al! positive integers n. Prove that 
the set of those n for which this statement is true has density 1. 


Proof. We look for solutions with y = z and a = na for some positive integer 
a, The equation becomes day = n(y + 2x) or, equivalently, y(4a — 1) = 2na. 
Thus, if we can find a prime factor p of n of the form p = 4a — 1, then we can 
take y = a and we have a solution in positive integers. Thus, it is enough 
to prove that the set of integers having at least one prime factor of the form 
4k — 1 has density 1, which has already been discussed. oO 


14, Let T be the set of positive integers n for which the equation n? = a? +8? 
has solutions in positive integers. Prove that T has density 1. 


Moshe Laub, AMM 6583 


Proof. We will prove that n € T if and only if n has at least one prime factor 
of the form 4k +1. Suppose first that n € T and choose positive integers a,b 
such that n? = a? +b, If all prime factors p of n are of the form 4k ~ 1, then 
for any such p we have pla? +b, so that p divides a and p divides b. Dividing 
the previous relation by p” and repeating the argument, we deduce that p*™) 
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divides a and 6 Since this happens for all pin, it follows that n divides a 
and 4, which is clearly impossible. Conversely, if n has a prime factor p = 1 
(mod 4), by Fermat’s theorem we can find integers c,d such that p =? + d?. 
We may assume that c,d are positive (they are nonzero since primes are not 
perfect squares). But then 


re (Gee ay ‘ (==) 


and so n € T. Now, the density of those numbers which are not divisible by 
any prime of the form 4k + 1 is 0 and we are done. 


We continue with two very nice problems concerning primes of the form 
4k +1. The method used in the solution of the following problem is standard. 


15. Let p be a prime number of the form 4k + 1. Prove that 


> [val = 


Proof: Write p = 4k +1 and note that 


k k 2k 
Dilviel =o =) Yo 


ele Sip i=l] koje 


since the inequality i < //7p IP with 1 < 7 < & implies that 7 < 2k. On the other 


hand, the condition 7 > © is equivalent to 7 > 1+ [¢ am since £ is not an 


integer. Thus we can also lanits 
2 3 2k f 42 
pvia= 5 (+-[F]) -#- 6] 

=1 i=1 


and it remains to prove that 
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Note that 


2 2 
#? (mod p)=?~p (=| 
P 


and since 

ye _ Ph(2k +1) 

i=1 3 ; 
it remains to prove that the sum of the quadratic residues mod p is pk. But if 
iy @2,-..,Ep51 are the nonzero quadratic residues mod p (any residue mod p 
is implicitly taken between 0 and p~ 1), then p — 21,p—29,...,p—~ xox, are a 


permutation of the a;’s (since —1 is a quadratic residue mod p, which follows 
from p = 1 (mod 4)). Thus 


2k 


Ya = e- zi) 


t=1 


and the result follows. Oo 
The following functional equation is rather nonstandard. 
16. Find all functions f : Z* — Z with the properties: 
f(a) > f(b) whenever a divides b. 
2. for all positive integers a and b, 
f(ab) + f(a? +B?) = fla) + f(b). 
Gabriel Dospinescu, Mathlinks Contest 


Proof. Since 1 divides any integer, it follows that f(1) > f(x) for all «. Let 
k= f(1). 

The first step is to prove that f(n) only depends on the prime factors of 
n and not their nuultiplicities, Le. 


Ile 
pin 
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Indeed, consider two positive integers a,m and choose b = am. Thus 

f(@m) + f(a(m? + 1)) = Fla) + Flam) 
and by the first condition f(a) = f(a2(m? + 1)) and f(am) > f(a?m). Thus 
both of these inequalities must be equalities and so f(am) = f (a?m) for any 
positive integers a,m. This immediately proves the claim. 

In the second step, we prove that f(n) does not depend on the prime 
factors of n that are congruent to 1 or 2 modulo 4. Indeed, the proof of the 
previous claim shows that for all n and « we have f(n) = f(n(a? +1)). In 
particular, f(n) = f(2n) and so we don’t have to care about possible powers 
of 2 in the prime factorization of n. Also, if p = 1 (mod 4) and z is chosen 
such that plz? +1, then 


f(n) > f(np) > f(alz* +1) = Ff) 


and so f(np) = f(n). In conclusion, we have 


fi)=fl{ J] ep 
pin,pePs 
where P is the set of prime numbers of the form 44+ 3. Let pi, p2,.-. be the 
elements of P3 and define g(A) = f([],e4 Pa)- We obtain a function g defined 
on the set of all subsets of N with integral values. Moreover, we claim that 
9(0) = k, S$) C Sp =+ g(S)) > g(Se) and finally 


981) + g(S2) = 9(S1 U S2) + 9(51 9 Sa) 


The first two relations are obvious. For the third one, note that if the sets 
of prime factors congruent to 3 mod 4 of a and of b are A, B, then the set of 
prime factors of the form 4j +3 of ab is exactly AU B and the set of prime 
factors of the form 4j+3 of a? +b? is ANB. If g({n}) = ky for some integers 
kn < k, then an easy induction on |A| shows that 


(A) = So ha - WAI ~ DE. 
atA 


Conversely, any choice of such k, yields a.corresponding g and the previous 
construction yields a solution f of the equation. This ends the solution. 0 
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The following challenging problem requires some estimates about prime 
numbers that follow from Dirichlet’s theorem, for which we refer the reader to 
addendum 7.A. 


17. Prove that the equation 2? = n!-+1 has only finitely many solutions in 
nonnegative integers. ‘ 


Proof. Suppose that (x,n) is a solution of the equation 28 = n! +1. Then 
nl = (x? —1)(x? + 1)(at + 1). 
Let A,, be the set of prime numbers p <n of the form 4k +3. The key point is 


that ne p € A, can divide 2? + 1 or # + 1, so that proir) must divide x? — 1. 


Thus we obtain 
Yr > 22 -1> Ul pro), 
PEA, 
Then, using that vp(n!) > n/p— 1 (by Legendre’s formula), we obtain 


1 
qrinn> In Wri > ( -1) Inp. 
pEAn 
A classical inequality of Erdés (theorem 3.A.3, chapter 3) yields 


We deduce that 


for any solution (x,n). 


Now, it remains to prove that there are only finitely many such integers 
n. This is a consequence of the proof of Dirichlet’s theorem, which establishes, 
among many other things, that / 


for n > 00, See addendum 7.A for a proof. 


FOE eee PE ee ee TE ee ee ee ee NR Oe ee OM gE ee ge eh pee ag 
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It is really amazing that the following result has a purely elementary 
proof. We present here the beautiful idea of John H-E.Cohn and we refer the 
reader to [18] for other similar results (including the fact that 1 and 144 are 
the only squares in the Fibonacci sequence). 


P18. Let Lo = 2, Ly = 1 and Enya = Lay1 + Ln be Lucas’s famous sequence. 
Then the only n > 1 for which Ly is a perfect square is n = 3. 


Cohn’s theorem. 


Proof, If a and xz are the roots of the polynomial X2 — X ~ 1, then L, = 
2? + 2%, which combined with cyr2 = ~1 yields Lon, = £2 — 2(~1)". This 
already shows that if L, is a perfect square, then n is odd, for y? +2 is never 
a perfect square. 

The case when 7 is odd is much more subtle. There are two key properties 
of the Lucas numbers that make everything work. The first is that Ly 
(mod 4) whenever k is an even number not a multiple of 6 (use the previous 
formula for La, and the fact that L, is odd whenever n is not a multiple of 
3). Call such a number k good. The second key ingredient is the fact that L, 
divides. Ly.42% +n, for all good numbers & and all nonnegative integers n. This 
follows from k = 0 (mod 2), the equality 7,22 = ~1 and the computation 


Enon + Ln = of (at +1) + oR (2 44) = 


apt hak + ah) + af * (ak + ch) = Le Date 


Assume now that n = 1 (mod 4) and n > 1. Then we can write n = 
1+2-3°k for a nonnegative number r and a good number k. Applying the 
second key point 3” times, we deduce that Ly = —L,; = —1 (mod Ly). As 
Lp = 3 (mod 4), it has a prime divisor of the form 47 + 3 and so L,, cannot 
be a perfect square. : 

Assume finally that n = 3 (mod 4) and n > 3. Then we can write 
n = 3+2-3° for a nonnegative number r and a good number k, The 
same argument shows that L, = ~L3 = —4 (mod Lx) and we reach the same 
conclusion, as numbers of the form 2? + 4 have no prime factors of the form 
4j + 3. O 
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4.1 Notes 


Many of the solutions to the problems in this chapter were provided by the 
following people: Alexandru Chirvasitu (problem 14), Daniel Harrer (problems 
2, 8), Benjamin Gunby (problems 1, 6, 9), Fedja Nazarov (problems 12, 15) 
Gjergji Zaimi (problems 5, 7, 13, 16). 
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Chapter 5 


7T5’s Lemma 


All of the following problems fall to a rather handy inequality, known as 
T's lemma even though it is a special case of the Cauchy-Schwarz inequality. 
This result: says that for all real numbers 41,42,...,@, and all positive real 
numbers 2), £2,...,%, the following inequality holds 


2 2 2 
a a a: 
pas oer ee ep & 
@y ee Fy, 


To get the reader familiar with this trick, we start with a series of more 
direct applications. 


1. Let 21,22,...,0n,Y1, Yas - 3°) Yn be positive real numbers such that 


Trt hate + oy 2 Vy + Taye bo + Eryn 


Prove that 
xz @ x; 
Meogt-e tas S34 Sy. 
Yi Ye Yn 


Romeo Tlie, Romania 1999 
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Proof. Using T)’s lemma, we can write 


2 2 
xy a 5 (oa) zi) 
te 
DG ag Sa 
the last inequality a mas the hypothesis. mi 


2. Let a,b,c be nonzero real numbers such that ab + be + ca > 0. Prove 


that: 
ab go ca 


Nhe 


32 
Titu Andreescu 


Proof. The trick is to add 4 to each fraction, in order to exploit the identity 


ab +5 1 ea (a+)? 
e+e 2(a? + 8)" 


With this observation, the inequality becomes 
(a +b)? 
> a? + §2 22 


and it is then a trivial consequence of T)’s lemma, combined with the hypoth- 
esis ab + be + ca > 0, 


3. Prove that for any positive real numbers a, b,c, d satisfying 
ab+be+cd+dea=1, 


the following inequality holds 


3 B 3 & 1 
Senneniennne aevicieetceaay H es 
bee+d + oydea *dtatb atbte 3 


IMO 1990 Shortlist 


Gi 
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Proof. Tz's lemma gives a lower bound 
Sp a ee 
b- oer 2 Sa(b+e+d)’ 


Salo +e+d) = (Xe)’ - Ye? < 3370, 


it remains to prove that 57a? > 1. But again by Cauchy-Schwarz we have 


Since 


1 = (ab+be+cd+da)* < (a? +b? 4c? +d), 


which proves that 57a? > 1 and finishes the solution, 


4. Prove that if the positive real numbers a, b,c satisfy abc = 1, then 


ins, pe SO & >, 
beet]  ctatl | atbsl1 ~ 


Vasile Cartoaje, Gazeta Matematica 


Proof. The solution using T2’s lemma is straightforward: 


@ a (Sa)? 
a, sass Dre +c4+1) es 2M ab+ Via’ 


so that we only need to prove the inequality Se? > Soa. This follows from 


Sa > wat and S> a > 3 (the first being a consequence of Cauchy-Schwarz, 
the becond | being the AM-GM inequality). a 


5. Let a,b,c be real numbers such that 


Prove that 


Titu Andreescu 
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Proof. Observe that 


Lens ee ari =} 


On the other hand, we have 


yes ee 
+i a+ P+ errs 
Combining the two inequalities immediately yields the result. 


The following problems are a bit less straightforward. However, they do 
not require any heavy machinery. 


6. Prove that for any positive real numbers a,b, ¢, 


ae 1 1 2 eee 


a+b? b+e cota’ 2Wabe~ (arbor oeha) 
Titu Andreescu, MOSP 1999 


Proof. The solution using T2's lemma is a bit tricky: the point is to look at 
the denominator of the right-hand side, because 


(a+b)(b+c)et+a) = (at bc? +.(b + cla? + (c+ a)b + abe. 


This suggests writing the left-hand side in the following way 


eo, @ | _ (alep 
e(atb)  at(b+ 37 Pla+e) 2abe 
A direct application of T)’s lemma finishes the proof. Q 


7. Prove that for any positive real numbers a, b,c the following inequality 
holds 


ab + be +e ca 


yet. a? +b +c 


Gabriel Dospinescu 
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Proof. Using T2’s lemma, it is sufficient to prove the inequality 


(7? +P +4)? 3 +R +e? 


(b+? +P(e+a)+latb ~ 4 abt betca 


Note however that the obvious application of Ty’s lemma fails. The previous 
inequality is equivalent to 


4(@? + 0 + ©)(ab + be + ca) > 3(a7(b+ c)? + 0(c + a)? + 2(a +b)%). 
Unfortunately, the only reasonable way to prove this is to expand it in the 
form 3 ' 

24 22> 2 2)2 4 
S* aba +°)> 58 B + zabe S74. 
Fortunately, this is trivial, since 
So abla? + 6?) > 25" a?b? and $7 a?b? > abe Sa. a 


It is possible to solve the following problem using T’s lemma, but the 
proof is not really elegant. In the addendum we discuss some applications o: 
Hélder’s inequality, which makes this problem really easy. 


8. Let a,b, ¢ be positive reals such that abc = 1. Show that 
1 i 1 1 


@(b + 2c)? TF Slop 2a) 7 {a “+ 26)2 = Ee 


Titu Andreescu, USA TST 2010 


Proof. Start by making the substitution 


The inequality becomes 
3 y 2 
east po = 4 mins SS 
(2+ 2y)? — (e+ 22)? ° (y+ 22)? — 3 


mm 
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Applying T)’s lemma, it is enough to prove that 
3(a? + y? +22) > Soa? (z+ 2y)?. 
The right-hand side is equal to 5 )> a*y?+43> x. Thus, we need to prove that 


35-28 +65 (ey)! > 5 (ay)? +450 a. 


Note that 


Lew! = Dew vive 5 Dv Lev? 2 Tew’, 


the first. inequality being Chebyshev’s, the second one by the AM-GM inequal- 
ity and the fact that ryz = 1. Thus, it suffices to prove that 


yo + Vy 2 4y oa. 


8 ; 5 
But 30° + y22? > 42¥ and so it is enough to prove that S727 > Siz. This 
follows from the power-mean inequality and the fact that z+ y+ 2 > 3. 


Proof. As in the previous solution, we reduce the problem to proving the 
following inequality 

BPs ane 8 
+ Qy)2 " (e422)? ° (y+ 22)? 


3 


se 
~ 3 


Using the AM-GM inequality, we can write 
3 


x yt 2 2y+z  & 
(Qy+2)? "27 7 3 


and two similar inequalities. Adding them yields the following estimate 


a : y : 3 yt +yt+z 
(z+2y)? — (w@+2z)? © (y+ 2x)? ~ 9 


and we end up using once more the AM-GM inequality. 
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It is really not easy to prove the following inequality using T)’s lemma, 
but the trick is worth remembering. 


9. Prove that for any positive real numbers a, 6,c the following inequality 
holds 


1 1 i = 1 i Jl 


M.O. Drambe 


Proof. Choose three positive real numbers a, 8, and use T>’s lemma in the 
form 


oe: Bo, (a+ B+ 


_ egret > - 
3aLb * +e 8c4+ a ~ a(3a +7) + b(38 + a) + c(8y + B) 


Now, we impose the conditions 
Saty=2, 38+a=1, 374+8=1. 


Solving this linear system yields the solution a = $ B= hy = g. Therefore, 
we obtain the inequality 

4 1 rt 1 2 1 uk 

=: toe +z: ae : 

7 3a+b 7 3b+e 7 3c+a~ 2a+b+e 


Proceeding in the same way with the two other terms of the left-hand side and 
adding up the resulting inequalities yields the desired result. Qo 


Proof. We can also use the trick of integrating polynomial inequalities to de- 
duce fractional inequalities. Namely, the inequality 


ay + peta > eyelet y+2) 


can be easily proved using T>’s lemma, since it can be written in the form 


2 2 2 
x 
ee pa ty te. 
5 ee ae) 


212 Chapter 5, T2’s Lemma 


Using this inequality, we can write 


petb-1 =e gsbte-} + petal > patbtel + pbteta-t + pretatbet 


a 


Remark 5.1. The technique used in the second proof looks unusual, It is 
actually quite powerful and we refer the reader to [3], chapter 19 for many 
more applications. 

The following two problems are closely related and use a rather useful 
inequality. 


10. Prove that for all x > 4 and all 71, 22,...,%, > 9, 


x, 
Pg Rg ps 


ty tb tq 21+ ay Epi ob Ly 
Tournament of the Towns 1982 


Proof. We start. in the usual way by using T2’s lemma: 


zy 22 : tn 


tit _ > 
Int 0q C1 +2y En-1 +2) 


ta)” 
+t 2n(Pn—1 +a)" 


Since 


21(@_ +22) + ve{e +03) ++ Oy (Ln—1 01) = W(aiwe + eeg+++++4y21), 


it remains to prove the following very useful 


Lemma 5.2. Ifn > 4 and xj,@2,...,G, are nonnegative real numbers, then 


(ay + gt +++ tan)? > daze + 2903 +--+ + Bpr1). 
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The proof is a bit tricky. If n is even, the inequality follows trivially from 
the chain of inequalities 


Away +--+ + &pt1) < Afar + og ++ + tp_a)(a2 +g te tap) 
S$ (11 +29 ++++4+2n)°, 


the first one being obvious and the second one being the AM-GM inequality. 

For n odd, things are subtler, but we can reduce the problem to the 
case when n is even by the following mixing argument: we may assume that 
#1 > £2, so that we trivially have 


Lily + UyH3 + Cglq S T{L2 + 1X3 + 3x4 < 1 (LQ -+ 1g) + (wo + 05) 24 
Thus, replacing 2, £9,...,%, by 1,02 +43,24,...,2n, We preserve the sum 
of the «;’s while not decreasing the quantity xyr2-+ +--+ Ip. Since n—1 is 


even, everything follows from the previous step. ol 


lL. Let n > 4 be an integer and let a1, @2,...,a,, be positive real numbers 
such that af + a3 +--.+a2 = 1. Prove that 


ay ag 
+ ae ee 
atl a+ at = 


aga + aq\/a3 + +++ + Gn Vin). 


5 
Mircea Becheanu and Bogdan Enescu, Romanian TST 2002 


Proof. Applying To’s lemma we obtain 


> a =\- 7 a 2 (Sai Yay? 
yy +1 a Si aF(a?,, +1)" 


(az + it 


Since Sy a? = 1, we have > ata?,, < 1/4 by lemma 5.2. The result follows. 0 


We give two proofs for the following beautiful problem. The first one 
is a standard application of T2’s lemma, the second one uses a very useful 
technique. 
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12. Prove that for any positive real numbers a, 6,c, the following inequality 


holds : b re 


a + - 2h 
Va+8be Vee + Bea) Ve? + Bab 
Hojoo Lee, IMO 2001 


Proof. The most natural application of Ty’s lemma turns out to work very 
smoothly. Indeed, 


(a+b+e)? 


a 
Uyare Laat ava? + 8be 
The only problem is to show that 
So aa? + Bbe < (a+ b+ 0). 
This suggests using Cauchy-Schwarz and, indeed, we have 
(Soa ait be)” < Vso Ve “+ Idabo, 


Thus, the problem is solved if we can prove the inequality 


S a3 + Mabe < (a+b +e)*. 


Fortunately, after expansion, this becomes }7 a(b— ¢)? > 0, which is obvious. 


Proof. Make the substitution 


/ ae - f yt ees ie 
a ey kb 9 VR +8ac’ Voy san 


Multiplying the relation * -Lle= ile and the two similar ones yields the 


relation : F . 
=-1){=-+ oy —1) = 512. 
(2 :) (3 :) (3 ) 
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The problem asks to prove that under this assumption we have z+y+z > 1. 
Assume that this is not the case, so ¢+y+z< 1. Let 


© 
a >e; Y= Eee s Ve > 
E+y+2 Etyt2z ctytZz 


2, 


so that X+Y +Z = 1 and 


ves (8-1) (4-)(B-) 


We deduce that 
512X?¥2z? 
>(X+Y)V¥ 4+Z)(Z4+ X)QX+¥4 ZQY 4X4 Z)QZ4+X+Y), 


which is impossible since X + Y >2/XY, 2X+Y+Z > 4VX°YZ and the 
similar inequalities obtained by cyclic permutations. 


There are two traps in the following problem, making the problem harder 
than it appears at first sight. 


13. Let a,6,¢ be positive real numbers such that ab + be + ca = 3. Prove 
that 


eS ee ae 
Qa+-b? § Wc? § %+a2 ~~ 


T.Q. Anh 
Proof. The inequality we have to establish seems to be opposite to the usual 
applications of T>’s lemma. This is actually not a very serious problem, since 


we can always change the terms of the sum a bit and change the sign of the 
inequality. Indeed, note that 


a ak i= be 
Qa+b? 2 2a + b2 )* 
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Thus, the inequality is equivalent to 
2 nde 2 
BPE +4 oe + — >1 
Qa+b? “ W+e? wre 


And now, another trap: one would be tempted to use 72’s lemma in the 
obvious form, but it is not difficult to check that this does not work. Instead, 
we take advantage of the hypothesis ab + be + ca = 3 to write 


2 
ova, (Eev2) 
Darr Zab + B= 6+ 08° 


Thus, it remains to prove that Sy(ab)*/ 2 > 3, which is immediate by the 
power-mean inequality and the hypothesis. fl 


The following problem is also quite tricky. 


14. Determine the best constant ky such that for all positive real numbers 
4,02, ...,@n Satisfying aya2--- a, = 1, the following inequality holds 
4189 akg : An Gy k 

; yoo so = fess a _ 

(a2 + a9)(a3 + a1) (aj + 43)(0§ + a2) 


"(E+ aia tay) ~ 
Gabriel Dospinescu, Mircea Lascu 
Proof, The basic inequality that will be used is 
(a? +y)(y? +2) > zy(1t+c)(ity). 


‘This reduces immediately to (x? ~ y*)(w — y) 2 0, which is clear. This already 
shows that for n = 2 the maximum value is ky, = 3, since (1+ a2)(1+y) > 4 


if ay = 1. 

Let us assume now that n > 2. We will prove that k, == 7% — 2. To prove 
that ky, > —2, it is enough to exhibit sequences @1,a2,.--,G@n for which the 
value of the expression 


GiQit+1 
F(a, 02,..+5@n La 


2 ae itd Maz, + aj) 


217 


is close to n — 2. This can be done easily, by taking n ~ 1 of the variables 
equal to some x very close to 0, and the last variable equal to z'-", The 
difficult part is proving that F(a1,a2....,@,) <n ~ 2 holds for any sequence 
@,@2,..-,@y a8 in the statement. Using the basic inequality, this reduces to 
proving that 


= 1 
vont em LD 
eS Fa)dt+an,) ~ 


i=l 


for any positive numbers a; with aya2-+-a,, = 1. To prove this, write a; = ste 
iE 


with 2p41 = 21. Subtracting 1 from every fraction, we obtain the equivalent 


inequality 
re BLig1 + Vei.e + xR 
Git ty Rint) (Cigr + ra 


eid. 


This is too siibicabed to try a Cauchy-Schwarz approach, but it simplifies a 
lot if we observe that 


RiCin1 + ViLi42 + ee i 


(zi + Big) (Gin $ 42) ~ 2+ tia 


ai 
se 
Bit Li41 


it remains to prove the inequality 


pre tha ame >] 


© + Tigi) (C41 + Ti+2) 


Since 


T;’s lemma reduces this to the easier inequality 


O ni) 2 Dy a? +2 SS, Lint + eres 


Expanding the left-hand side makes the previous inequality obvious and fin- 
ishes the proof of the fact that ky, =n —2 for n > 3. iia 


The following difficult problem seems to be exactly the opposite of what 
is usually called a standard application of 7o’s lemma. It turns out that we 
can actually apply 72's lemma, but in a very nontrivial way. 
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15. Prove that for any positive real numbers a, b, ¢, 


Qa+b+c)? | Qb+eta)y | Qctat a)? 


- . > <8. 
32+ (B40)? We (c+al? 2? + (a+b? 


‘Titu Andreescu and Zuming Feng, USAMO 2003 


Proof. Writing 


b+e ate a+b 
Beg ew Bo ee 
we can write the inequality as 
(+2)? 2+y? , +2)? 1+20 1+2y 142 25 


7 <8¢ = ta <5 
Spm ) Tey T Tee ROT arya y+ 2° 42> 2 


2 _ 12 ~ p2 
@=1)? , wow = yPd 
a +2 ye+2 zt4+2 ~ 2 
In order to prove the last inequality, we use T2’s lemma in the obvious form 
and so it suffices thus to show that 


Expanding (a + y + 2 — 3)°, we reduce this to proving that 


Sie-wSo2 +43 ay +1220. 


This is not obvious, but the observation that gw? +4 > 4x reduces it to proving 
that 3° cy > 250 2, which is problem 7 in chapter 1. a) 


Proof. This solution uses the linearization method. To simplify computations, 
we may assume (since the inequality is homogeneous) that a+6+c = 1. Define 
the map 


fo) = aya 


The inequality can also be written as f(a)+f(6)+ fle) < 8. We will try to find 
u,v such that f(x) < ux +v for any x € (0, 1], with equality for « = 4 (which 
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is the obvious equality case in the original inequality). Thus, we should have 
f (3) = $40 and also f’ (4) = u, since 4 would be a minimum of ux-+u— f(x) 
on [0,1]. A straightforward, but tedious computation shows that u—= 4 and 
v = 4. We need to prove now that this pair (u,v) really works, ie. that 
f(z) < ux + v holds for any x € [0,1]. Clearing denominators and expanding 


everything yields (after a tedious computation) the equivalent Inequality 
36u° — 15a? —2¢ +1 >0. 


But the conditions we imposed were made so that the left-hand side is divisible 
by (3z ~ 1)?. Doing the euclidean division shows that the left-hand side is 
(3a — 1)?(4x + 1), which is obviously positive. Finally, we just have to add the 
inequalities f(x) < ux + vu for x € {a,b,c} to end the proof. Q 


The following problems are harder than the previous ones. ‘They are still 


based on 74's lemma, but applied in more subtle ways and often combined 
with other tricks. 


16. Let a,b,c, d be positive real numbers such that abcd = 1. Prove that 


op i res fe ee 


>i. 
Gta? aso2 * Geo? Gta? 
Vasile Cartoaje 


Proof. The proof using Ty’s lemma is rather mysterious. By performing the 
substitution 


get Yale we 4 d= 
x y z 
we obtain the equivalent inequality 
2 2 2 2 
x z 
qonemennse oh ot 
(ey)? © (yt+z)? 9 (tt)? © +2) 


Of course, an immediate application of T>’s lemma fails rather badly, so we 
need something more clever. Let us apply J>’s lemma for the first two and 


i 
{ 
{ 
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then for the last two terms of the inequality. If we try to prove the stronger 
inequality 
(2+y)" (2+t? ; 
fe@tyP+ytz? (tt? +GFz)? = 
we easily realize! that it is equivalent to (y+ z)(t{+ a) < (e+ y)(z + 8). 
Unfortunately there is no reason to have (y+ z}(t+«) < (a +y)(z+8). The 
miracle is that if this fails, then we can apply 7)’s lemma for the first and 
fourth terms of the initial inequality and then for the middle terms. In this 
case, we obtain the stronger inequality 
(+t)? @t+z? gy 
(wtyPt(etrt? (yt z+ (ett? 
A similar argument shows that this holds precisely when (@ + y)(z +t) < 
(y-+2)(a+t), in particular whenever (y+ z)(t+2) < (x +y)(z+) fails. The 
result follows. im) 


Proof. This solution is not natural, either, but it is very elegant. We claim 
that for all positive mumbers a,b we have 
1 1 1 
3S = 

(+a Gb? -1+ab 
This is rather easy to prove, though it requires some nasty computations. 
Namely, by clearing denominators and performing the obvious simplifications, 
we reduce the problem to proving that. 


ab(a? — ab +b") —2ab+12>0, 


which is equivalent to 
abla — b)? + (ab~ 1)? > 0. 


Once we have this inequality, we deduce that 


by I > 1 4) u prope 
(+a)? ~ tab" l+ed~ l#ab' 1+ab 


1 is convenient to denote ¢ = ue and b= 72. 
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The following problem is a bit tricky, especially because of the strange 
hypotheses. 


17. Let n > 16 be a positive ieee and suppose that the positive numbers 


@1,G2,-.., Am Satisfy a) + a2 +--+ +an = Land ay +2ag+---+nan = 
Prove that 


(a2 ~ a) V2 + (ag — a2) V3 +--+ (An — ani) Vn < 0. 


Gabriel Dospinescu 


Proof. The first step is to apply Abel’s summation formula to rewrite the 
inequality as 


ay V24 (V3 — v2) ais + anil/n —vn —1) > ann. 
Using the obvious bound a;/2 > a(v2 ~ 1) and the formula 


Vk ~ Ve-1 = 
Vet 7E 
an application of T2’s lemma shows that the left-hand side is greater than 
(a1 +4aa+ yp 


ma vit Vit 1) 


On the other hand, Cauchy-Schwarz together with the hypothesis give the 
estimate 


nl 
Bae < V2 
4=1 
and also 
nal 
> avi+i < V3. 


Therefore, taking into account a Gy +Gg+--++an—-1 = 1— ap, we still need 
to prove the inequality 
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But since 


1 = a) + 2agt...+nan ~ (@) +... +n) > (n~ Ian, 


we have an < fa 7 and so it is enough to prove the previous inequality with a, 


replaced by =+;. But this is immediate for n > 16, which ends the solution. 
Bo 


We end this chapter with two challenging inequalities. The first one uses 
a combination of T>’s lemma and a rather subtle linearization technique. 


18. Prove that for all positive numbers a, 6,¢ the following inequality bolds 


Fea af te, PTS © 
Neo Viera? Vas 


Vo Quoc Ba Can 
Proof. We use T;’s lemma in the form 


fia (Vay 
VSb+e Valset+e)” 


so it remains to prove that 


(Sova) > So Varo. 
Define « = /a,y = Vb,2 = Ve 


Seva ee < (Xz). 


This is actually a very strong inequality and it is easy to see that it resists any 
attempt to prove it using Cauchy-Schwarz (even if its form invites us to use 
such a technique). We will use a linearization technique, by approximating 
first \/8y? +2? by a rational function in y,z. The crucial ingredient is the 


following estimate: 
V 8y? + 2% < By t+2- sue 
od a Qy +z 


so that the inequality becomes 
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The hard point is to figure out that such an inequality holds, since proving it 
is a very easy matter. Note that the right-hand side is clearly positive, so by 
squaring and canceling similar terms we obtain the equivalent inequality 


2 Syz?_. Sye(Byt2) _, 4v?(y— 2)? 
(Qyt+2) ~  Qytez Qy+2)? = 


Using this estimate we conclude: 


So aV/8y2 + 2? < 47 ay - Bxyz 7 STG 


and since 


Day ae 
Qytz2~ et+yte 
it remains to prove that 


9: 
437 ay ~ rer So 2? +25 > ay, 


It is not difficult to check that the last inequality is actually equivalent to 
Schur’s inequality, which finishes the proof of this hard problem. 0 


The next problem requires some preliminaries. We will prove the following 
beautiful discrete variant of a classical inequality of Wirtinger. 


Theorem 5.3. (Fan’s inequality) If 21, x2,...,2p, are real numbers which add 
up to zero, then 


(a? +x} +++ 422) cos > aya + erg +++ +a nz1. 

Proof. We will actually mimic the proof of Wirtinger’s inequality by using 
finite Fourier transforms (for more on this fascinating subject, the reader is 
invited to read the addendum 7.A). Namely, if 21, z2,...,2n is a sequence of 
complex numbers, define 


n 
= 1 _ Dinky 
a4a45>-—>~ ZEE ay 
2 S K 
Th 
vn kel 
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Using the fact that 


if and only if j is not a multiple of n (in which case it equals 1), it is easy to 
check that we can recover our sequence from the sequence of its finite Fourier 
transforms using the inversion formula 


‘We also have the following discrete version of Parseval’s identity: 


nt mn 


So lal? = S$” zel?. 


k=l heos 
Indeed, 


e.g 1 agony hacked 
} el =-5 y hy Mae rs 

n 
k=l 


3 hike 
1 om for —Pimley he) 
ss hey Bhs Se % 
n hy @hp : 
ky ko i 
= 0 mie 
ky=ke 


=>" lz. 
k 


Now, the proof of Fan’s inequality is very easy: write the inequality in the 


form 9 
‘ - On P 
y (te — kai) = (2- 200022) Saf 
k k 


Note that the hypothesis 7) +a@2+++++@, = can be written in the very simple 


in, 
form &», = 0. Now, let? yj = aj ~ aj21 and observe that §j = (1 — en” By, as 
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follows from the definitions. Thus, using Parseval’s identity we obtain 
Lei = Loa? 
j 3 
—~, 2iRi 9), 
= Ste Play? 
j 
ni Qn 
> 2-~ 2cos = ) |a, 
2 (2-200 2) fa 
jal 
Qn 9 
= {2~2cos —~ #,;* 
( COs == ) » |e,] 


and another application of Parseval's identity yields the desired inequality. 0 


2 


We are now ready to prove the following version of Shapiro's inequality. 
The proof is based on a very tricky application of Ty’s lemma combined with 
Fan’s inequality. 


19. Let a, = mud = Prove that for all 21,22,....¢_ € [2.a], the 


mt 


>. 
ae 
Vasile Cértoaje, Gabriel Dospinescu 


Proof. First, we write the inequality in the form 


Note that 
> 1 > Civt t Tie 
Gn 2az 
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Using Ty’s lemma, it suffices to prove that 
1 aon i 2 
7 i = Zi Ti4.2) — Se i Lis : 
(1-3) (Som)” > § (Soavtesss + asa) — soy Det ai) 
A crucial step is to note that we have the identity 


1 2 
SS ae(wins + tine) = So (we + i 41) (tins + Bi42) ~ 5 Sittin)’, 


which allows us to perform the substitution 2; + v4.1 = 2); and reduce the 
problem to proving that 


for nonnegative real numbers 6,;. 
This simplifies drastically if we make another substitution 


A rather tedious, but straightforward computation shows that the previous 
inequality is equivalent to 


1 
(1+) hd 222 aan. 
re 
Of course, we have c; + ¢c2 +++: +e, =0. Finally, note that 
1 Qn 
1+ e = 2c0s a 


Thus, the result follows from Fan’s inequality. sm) 


5.1 Notes 


We thank the following people for providing solutions: Alexandru 
Chirvisitu (problem 16), Xiangyi Huang (problem 14), Michael Rozenberg 
(problem 15), Dusan Sobot (problems 1, 3), Gjergji Zaimi (problems 1, 10, 
11). 


Addendum 5.A Hdlder’s Inequality 
in Action 


The purpose of this addendum is to present some applications of Hélder’s 
inequality, which are quite similar to the problems considered in this chap- 
ter. Of course, Hélder’s inequality is important because of its applications in 
measure theory, probability theory and analysis and not really for the amus- 
ing problems to be discussed here. Actually, we will not even deal with the 
classical version 


a a 
(af tah +--+ ak)e(bf + bh ++ 4.02)9 > aby + agby + +--+ anbn, 


which holds for any positive real numbers a:,5;,p,q such that 2 + 7 = 1, but 
rather with the following: 


Theorem 5.4.1, Let aiz be positive real numbers. Then 


k 
(aj + aja Fos + Ojn) > (Yariaa Gig +++ + Yeindan Gin) *. 
j=l 


Proof. This is a very easy application of the AM-GM inequality. Indeed, just 
add up the following inequalities 


ayi aki 
< pekl 
~ $y / Sy’ 
Sin _ Gen 
Sy Sy ' 
where S; = a3, + aig + +++ + din. QO 


We are now able to obtain a generalization of Ty’s lemma that turns out 
to be very handy in quite a lot of situations when Ty's lemma fails. There are 
of course much more general results than the following one, but our purpose 
is not to delve into the greatest generality. 
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Theorem 5.A.2. Let a1, a@2,...,@, and x1, £2,...,@q be positive real numbers 
and let q > p be two positive integers. Then 


a a a ohio op an > piteng (aa t a2 tor Fan)e 
ay ah ah ~ (ai fg +++ + an)? 


Ve q 4 

al a ay, 

(ay t gb eos + tn)? (3 + = shee “F 
we a ah 


4 q 4 
a as a 
we (ay tee tata) ++ (@p ke + Ep) - (3 +9+--+$) 
2 Dh, 
ah he Sei 
> (ab? +agtt +. + an yP 
and the result follows from the Power Mean inequality. o 


The first theorem is particularly useful when working with sums of square 
(or cubic or ...) roots, which are a nightmare most of the times. Here are a 
few examples that will probably convince you how useful this result is. Most 
of the problems are quite hard to solve by other means. 


A.2, Prove that for all positive real numbers @,b,¢ such that abc = 1 we have 
S74 VB T+ VEET <Uatrbte). 


Vasile Cartoaje 


Proof. This is quite a strong inequality, but the previous results are effective 
in such a context: 


(Vb +74 VET + V8 4 TP 
< (1 +14 I(a+ b+e)((a? + The) + (b? + Tea) + (c? + Tab)) 


and the result follows immediately from the inequality 


abtbe+eac ath 4c?. 
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A.3. For any positive real numbers a,b, ¢ the following inequality holds * 


DEC i cha | a+b % 
Va+be | VP fea’ Ve tab ~ . 


Pham Kim Hung 
Proof. Let S be the left-hand side. Using Hlder’s inequality, we can write 
$3: (Se +e)a? + be)) > 8(a+b+c), 
80 it is enough to prove the stronger inequality 
(a+b+c) > 2576 + c)(a? + be). 
An easy computation shows that this follows from Schur’s inequality. o 


A.4. Let a,b,c be positive reals such that abe = 1. Show that 


i 1 
a se ~ = 
20)?” Blet day? * Slay wp 


1 
eee 1 
a®(b + a 3 


Titu Andreescu, USA TST 2010 


Proof. The substitution 4 = 1,5 = 


i 1 i 
: =z teduces the problem to 


By theorem 5.A.2. 


3 kK 9 
x es ys 5 2 atyte: 
Qy+2)? " Qz+2)? 7 Qz+y)2— 9 


and the result follows from the AM-GM inequality. 
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A.5. Prove that if 21,22,...,2, are positive real numbers with product 1, 
then 


n™(1+a7)(1 + 2§)--- (1+ 28) 
1 1 


1] Ha 
> (ntatetmt ete tz) . 
Ey Ze Ln 


Gabriel Dospinescu 


Proof. This follows easily by adding the inequalities 


i ti aching 1 
/ (ad + A)(ag + 1)..-(@8 + 1) 2 ay + 218 2...2i-1 241. = i+ ee 
% 


obtained from Hélder’s inequality. QO 


A.6. For any positive real numbers a,b,c the following inequality holds 


af a+b atb+e_ a+Vab+ Vaic 
eh Pg 3 


Kiran Kedlaya 


Proof. Using theorem 5.A.1, we obtain 


(a+a+a) (a+ +?) (atb+e) 2 («+ \ 


One concludes by observing that ¢/ 24+) > vab. a 
A.7. Prove that for all real numbers a, b,c we have 


Alta) +b)(4 +7) > (1 +ab + be + ca)’. 


Michael Rozenberg 


> oe 


5.A, Hélder’s Inequality in Action 231 


Proof. This is a pretty tricky application of Hélder’s inequality: 
41422 +P +07)? = (P44 4100? 42 4B? +1) 
-4t1414D? +0441) 
> (1+ ab + be + ca). 


The result follows. 


A.8. Prove that for all positive real numbers a,b,c the following inequality 


holds —. 
abe + Y(a3 4B F(T +1) > ab + be + ca. 
Proof. Using Hiélder’s inequality, we can write 
Yi + VE + (+1) > abse. 
We would like to have 
abe + ab +c > ab+ be + ca, 


which is equivalent to (a ~ 1)(b— 1) > 0. There is no reason for this to hold, 
but at least one of the inequalities (a ~ 1)(b — 1) > 0, (b~ 1)(e~ 1) > 0 and 
(a ~ 1)(e— 1) > 0 holds, as two of the numbers a ~ 1,6 ~1,¢— 1 must have 
the same sign. The result follows. 


A.9. Prove that for all positive real numbers a,b,c the following inequality 
holds 


(a® — a? +.3)(b° — 8 +. 3)(8 ~ 8 +3) > (at b+). 
Titu Andreescu, USAMO 2004 


Proof. Of course, we cannot apply theorem 5.A.1 directly in this case, but the 
form of the inequality is a temptation to find a way to apply theorem 5.A.1. 
The key point is the inequality 


&~a?+3>a°42, 
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which is equivalent to (a? — 1)(a3 — 1) > 0 and thus obvious. Using this, the 
result follows immediately from theorem 5.A.1, by writing 


@®42=04R 40. f 


A.10. Let a,b,¢ be the sides of a triangle. Prove that 


abe [abe __abe Sak bee: 
Vitec * a ate 


f 


Titu Andreescu, Gabriel Dospinescu 
Proof. This is a hard inequality. The point. is to use Schur’s inequality 
(a ~ b)(a ~ c) + (b ~ a)(b— 0) + e(c~ a)(e—b) > 0, 
which can also be written as 
abelatb+e) 2 A(b+e-a)+ B(e+a—b)+(a+b~e) 


with the theorem 5.A.2. Indeed, observe that we can write 


The result follows. 


A.11. Prove that for all a,b,c > 0 we have 
2 2 2\4 3 
(5 + ke + c) > a7 (at +bi +e). 
€ a 


Proof. If X is the square root of the left-hand side, then Hélder’s inequality 


yields : 
X (CP +e + Fa*) > (a? +0 + ey, 
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so that after substituting ¢ = a?,y = 0? and z = ©, it is enough to prove the 
inequality 
(ty + 2) > Way + ys + ze)*(a? +y? + 2). 


But this is immediate from the AM-GM inequality and the identity 
Poy? +2? 4 ay + yz+ 20) = (ety tz), Qo 


A.12. Prove that for all positive real numbers a,b,e,d 


@+t Bei Aad B+1, abed+ i 
+1 +1 24+) @F17>° 7 


Vasile Cartoaje, Gazeta Matematica 


Proof. This is also a very hard problem. Of course, one is again tempted to 
use the theorem, but one needs a trick. The key point is that for all positive 
numbers « we have 


e+) afat+] 
a+i> Vy Pes 


Indeed, using the inequality 
(a? +1)' < (@ +1 (a +1), 
which is just Cauchy-Schwarz, one reduces this to 
2(a? +1)? > (a +1)*(a4 +1), 
which, after division by (a + 1)? and a small computation is equivalent to 
(a~1)*>0. 


Once we have this key inequality, the rest is an immediate application of 
theorem 5.A.1. im) 


Chapter 6 


Some Classical Problems in 
Extremal Graph Theory 


This elementary chapter is a variation on a classical topic in extremal 
graph theory: Turan’s theorem on graphs without cliques. Recall that if G is 
a graph and & is a positive integer, then a k-clique is a set of k vertices, any 
two of which are connected. A graph is called k-free if it does not contain a 
k-clique. We then have the following standard result, 


Theorem 6.1. (Turdn ) The mazimal number of edges in a k-free graph with 
n vertices is 52. sie + G) » where r is the remainder of n when divided 
by k—1. 

In particular, the maximum number of edges in a k-free graph with n 
vertices is at most fod : x which is really the estimate that we will constantly 


use, We start with a series of rather direct applications of these two theorems. 
The other problems are however different in nature and more difficult. 


2 

ii 

1. Let 21,22,...,2,, be real numbers. Prove that there are at most — 
pairs (%, j) € {1,2,...,n}? such that i <j andi < lai — 2;1 < 2. 


MOSP 2001 
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Proof. Consider the graph whose vertices are 1,2,...,7, Connect two vertices 
i,j by an edge if (i,j) satisfies 1 < |x; — 7,| < 2. We claim that this graph 
contains no triangle. If we manage to prove this, the result follows from 
‘Turdn’s theorem. Suppose that the graph contains a triangle, thus we can 
find distinct a,b,c such that 


1<|tqg—ml <2, 1<|ay— tel <2, 1< |ee—aa] < 2. 


By symmetry, we may assume that v_ < rp < x. Then the previous in- 
equalities become ay — ta > 1,%_¢— tp > 1, $0 Lp ~ La > 2, contradicting the 
inequality |x, — tal < 2. The result follows. 5 
ne 
2. Prove that if n points lie on a unit circle, then at most Zz segments 
connecting them have length greater than V2. 
Poland 1997 


Proof. Consider the graph whose vertices are the n points and connect two 
vertices if their distance is greater than /2. The point is that this graph 
contains no 4-clique. This is clear, since a chord of length V2 subtends a 
central angle of §. Thus, by Turén’s theorem the number of vertices is at 


most [| ; finishing the proof. C] 


3. There are 1999 people participating in an exhibition. Out of any 50 
people, at least two do not know each other. Prove that we can find at 
least 41 people who each know at most 1958 other people. 


Taiwan 1999 


Proof. This is a special case of the proof of Zarankiewicz’s jemma. For the 
reader’s convenience, let us recall the proof. Suppose that the conclusion does 
not hold, so we can find at least 1959 people, say Az,...,Aigs9, each having 
at least 1959 friends. Start with A 1. There is a person among Ag,...,A1g9s9 
that knows A). Assuming that we found persons Aj,,..., Aa, (i = 1) among 
Aj,-.-, Atgso, every two knowing each other, let us try to add a new person 
A;,,, to the group. But there are at least #- 1959 —(k -- 1)-1999 persons that 


Ved 
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know Aj,,-..,A;,. Thus, if &- 1959 — (k ~ 1). 1999 > 1, we can always add 
one more person. If k < 48, then there are at least 79 people who know all 
of Aj,,-..,A;, and one of them is among Aj, A2,..., Ajgos. If k = 49, then 
there are at least 39 people who know all of Aj,,-.., Ay, and (although that 
person may not be among Aj, Ag,...,Aigs9) he completes a set of 50 all of 
whom know each other. 0 


4. We are given 5m points in a plane and we connect some of them so that 
10n? +1 segments are drawn. We color these segments in 2 colors. Prove 
that we can find a monochromatic triangle. 


Proof. Note that the corresponding graph (with vertices the 5n points and 
edges between points connected by a segment) contains a 6-clique. Indeed, 
otherwise by Turén’s theorem it has at most (Sn)? : 4 = 10n? edges, which 
contradicts the hypothesis. Now, consider a 6-clique G’ of our graph G. The 
edges of G" are colored in two colors and we claim that there is always a 
monochromatic triangle in G’. This is standard, but we recall the proof. 
Pick a vertex v; of G’. By the pigeonhole principle, we may assume that 
2102, 0303, 0104 have the same color. If one of the edges v2v3, vg04 or Ug04 
has the same color as v2, we are done. Otherwise, the triangle veuge, is 
monochromatic and we win again. CQ 


The following problem does not use Turdn’s theorem, but it is still a very 
classical topic, namely Ramsey’s numbers. 


5. A group of people is called n-balanced if the following two conditions 
are satisfied: 


a) among any three people, there are two who know each other: 


b) among any n people, there are at least two not knowing each other. 


+2) 


Prove that there are always at most { ~« people in an n-balanced 


group. 


Dorel Mihet, Romanian TST 2008 
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Proof. We will prove the result by induction. For n = 2, this is trivial, so 
assume that it holds for n — 1. Consider an n-balanced group and pick an 
arbitrary person P. Let A be the set of friends of P and let B be the set 
of all the other persons in the group. By hypothesis, any two persons in B 
are friends and B has at most n — 1 elements. By induction, A has at most 
f2=2)n+) persons (by b) and the fact that P knows all elements of A, it 
follows that A is an n — I-balanced group). Thus there are at most 


fo MOD 644 y= A= N+) 


persons in the group, which is enough to prove the inductive step. 


The following problem and the method of proof are absolute classics. 


6. Prove that a graph with n vertices and k edges has at least tak —n?) 
triangles. 
APMO 1989 


Proof. Let: #1,22,...,2n be the vertices of the graph G and let d(x) be the 
degree of x. Observe that for a given edge ¢ = 2,2; with endpoints z;, z;, 
there are at least d(a;) + d(x;) — n triangles containing ¢. Indeed, there are 
d(z;) + d(a;) — 2 edges having as endpoints one of x;,x; and another vertex. 
among the n—-2 remaining vertices, so there are at least d(x;)+d(a;}-2—(n—2) 
triangles containing z;,, 2; as vertices. Summing over all edges and taking into 
account that we count three times every triangle in this way, shows that the 
number of triangles is at least (we denote by E(G) the set of edges of G) 


3 (Ala) + (as) ~ 2). 


e=,2j€E(G) 


The previous sum is also equal to 


; = dla? ~ nt) , 
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Applying Cauchy-Schwarz and taking into account that 
ci] 
Sy d(xj) = 2k 
t= 
yields the desired result. a) 


‘The following problem is very similar to the previous one. 


7. A graph with 7 vertices and k edges has no triangles. Prove that we can 
choose a vertex such that the subgraph obtained by deleting this vertex 
Ak 


edges. 


and all its neighbors has at most k ( baie 
n 


USAMO 1995 


Proof. Each edge ay gets killed when we remove x and its neighbors, when 
we remove y and its neighbors, or when we remove any common neighbor of 
« and y and its neighbors. Thus, this edge is killed a total of d(x) + d(y) 
times. Summing over all edges and taking into account that d(2) appears as 
a summand exactly d(x) times, we obtain that the average number of edges 
killed by.removing a vertex and its neighbors is Fy ¥, d(«)*. Since 


we have 


Thus, on average we kill at least ae edges. The result follows. 


8. A graph G has n vertices and contains no complete subgraph with four 
vertices. Prove that G contains at most a triangles. 


Ivan Borsenco, Mathematical Reflections 
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Proof. The result is easy for n = 2,3,4, so let us assume that n > 4 and that 
the result holds for all k <n. We may assume that G contains a triangle ABC. 
Let G, be the subgraph formed by the vertices of G different from A, B,C. 
Since G has no 4-clique, it has at most fea" edges by Turdn’s theorem. By 
the inductive hypothesis, G; has at most (nay triangles. Any other triangle 
in G consists either of a vertex of Gy and an edge of ABC or of an edge of 
Gy and a vertex of ABC. Moreover, any edge of G) forms a triangle with at 
most one vertex of ABC and any vertex of G; forms a triangle with at most 
one edge of ABC, because G contains no 4-clique. Thus G contains at most 


(n-3)  (n nb 
ep 1 7 
a7 + t+n—3+ 7 
establishing the inductive step. Note that the result is optimal if n is a multiple 
of 3, since we can consider the tripartite graph Ky/3,n/3,n/3- O 


It is a standard result that a graph with > 4 vertices and more than 
ntnydacd edges has a four-cycle (see, for instance [3], example 3, chapter 22). 
The following nice problem shows that this result is almost optimal. 


9. a) Let pbea prime. Consider the graph whose vertices are the ordered 
pairs (x,y) with x,y € {0,1,...,.p—1} and whose edges join vertices 
(z,y) and (2',y’) if and only if zx’ + yy’ = 1 (mod p). Prove that 
this graph does not contain a 4-cycle . 
b) Prove that for infinitely many n there is a graph G,, with n vertices 
and at least nyt — n edges that does not contain a 4-cycle. 


Hungary-Israel Competition 2001 


Proof. One can give a rather down-to-earth proof of a), based on explicit com- 
putations, but we prefer the following approach. Consider the F, = Z/pZ- 
vector space V = Fr and the standard inner product ((x,y),(a2’,y)) = 
xa’ + yy’. We are asked to prove that we cannot find four distinct vectors 
U1, 02,03,04 € V such that (uj,v41) = 1 for all 1 < i < 4 (here v5 = wy). 
If such vectors existed, vg and vg would be both orthogonal to the nonzero 
vector v; — v3 and so they would be contained in a line of V. Thus, we can 
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find A € F, such that v4 = Avg. Then A = 1, as (v1, 04) = (v1, 02) = 1. We 
conclude that v2 = v4, a contradiction. 

For the second part, take any prime p and set n = p®. The graph G, in 
the first part of the problem has n vertices, Let us evaluate the number of 
edges. If v = (x,y) is a nonzero vector in V, then the equation (w,v) = 1 has 
p solutions. Thus the number of edges is werd) and it is immediate to check 
that this is greater than nyt -~n= g — p*. The result follows. ia] 


We continue with two very nice problems concerning graphs with n2 + 1 
edges and 2n vertices. Note that this is the first case when Turdn’s theorem 
ensures the existence of a triangle. The following results show that we can do 
better, 


10. Prove that a graph with 2n vertices and n? +1 edges contains two tri- 
angles sharing a common edge. 


Chinese TST 1987 


Proof. We will prove the result by induction. For n = 2, this is trivial. Assume 
now that the result holds for n and consider a graph G with 2n + 2 vertices 
and n* + 2n +2 edges. By Turdn’s theorem and the pigeonhole principle, 
we can find a triangle zyz such that d(x) = d(y) (mod 2), Consider the 2n 
points different from x,y. If there are at least n? + 1 edges among them, we 
are done by the inductive hypothesis. If not, then there are at least 2n + 2 
edges whose endpoints are x or y. But since d(x) = d(y) (mod 2), it follows 
that there are actually at least 2n + 2 edges whose endpoints are x or y and 
which are different from the edge zy. Let A; be the set of vertices » x~ uy 
that are connected to © and let Ag be the set of vertices v x,y that are 
connected to y. Then the previous result implies that |A1{+|Ao| > 2n +2. 
Since |A; U Ag] < 2n, it follows that |A; Ag] > 2 and so we can find two 
distinct vertices z # t that are connected to both x,y. Then the triangles xyz 
and xyt share a common edge and we are done. Oo 


11. A graph has 2n vertices and n? +1 edges. Prove that it contains at least 
n triangles. 
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Proof. Again, the proof is by induction. For n = 2 this is trivial, 80 assume 
that the result holds for n and consider a graph with 2n + 2 vertices and 
n? + 2n +2 edges. By Turdn’s theorem we can find a triangle a)4223. Let 
A; be the set of vertices v # x1, 22,2 that are connected to x;. There are 
obviously at least 


S = |Ay Ag] + Ao As! + 1A3 9 All 


triangles different from x; r223, since any element of A; Aj forms a triangle 
with the vertices ajz;. Thus, if we have $ > n, we are done. Assume that 
S<n-—1. Then, using the inclusion-exclusion principle, it follows that 


Qn 1 > [Ay U Ag U Agl > |Ai| + [Ao] + [Aa] — 241, 


thus one of the numbers |A;| + {Ag}, |Ag] + | Agi, [Ag] + Aa] is smaller than 
2n~ 1, say |Ay}+| Ag]. But this means that there are at least 


n+In+2—-(Qn4+1) =n? +1 


edges among the vertices As, Aq,..-,Aenz2. Thus, by induction we find at 
least n triangles among these vertices and adding the triangle Aj A2A3 yielc 
at léast n +1 triangles. The inductive step is thus proved and the problem 
solved. 


The following problems are more difficult. The next one concerns cover- 
ings of the edges of a graph by cliques. 
Y 12. There are n aborigines on an island. Any two of them are either friends 


or enemies, One day they receive an order saying that all citizens should 
make and wear a necklace with zero or more stones so that 


i) for any pair of friends there exists a color such that each of the two 
persons has a stone of that color; 
ii) for any pair of enemies there does not exist such a color. 


What is the least number of colors of stones required (considering all 
possible relationships between the inhabitants of the island)? 


Belarus 2001 
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Proof. Let G be the graph whose vertices are the aborigines, two of them being 
connected if they are friends. Let B; be the set of aborigines who receive a 
bead of the i-th color. Then by condition ii) each B; forms a clique in G and 
by i) each edge is in one of these cliques. Conversely, if we have a collection 
of k cliques that cover every edge of G, then we can give a bead of color i to 
every aborigine in. the 7-th clique and satisfy the conditions of the order. 

The previous paragraph shows that we need to find the smallest number 
f(n) of cliques that cover the edges of any graph on n vertices G. Clearly 
f(2) = 1 and f(3) = 2. To find an optimal graph, consider a complete 
bipartite graph on n vertices. The key point is that such a graph has no 
triangles, so if we want to cover its edges by cliques, we need at least El (G) 
cliques, where E(G) is the number of edges. Therefore, we have a clear bound 
on f(n), namely f(n) > [=]. 

The hard part is to prove the opposite inequality. We will prove this by 
induction, going from n to n + 2 (which, combined with the first two values 
of f(n), will be enough to conclude). Consider a graph G with n +2 vertices, 
We may assume that at least two vertices are connected, say x,y. Consider 
the graph G; obtained by deleting 2,y and all edges adjacent to these two 
vertices. By induction, we know that we can cover its edges by f(r) cliques. 
If v is a vertex of Gy, consider the subgraph spanned by z,y,v. By adding 
either a triangle or an edge, we can cover all edges of this subgraph incident 
to v. Finally, by adding one more edge to cover the edge zy, we covered all 
edges of G by using at most f(n) +n +1 cliques (by the way, all the cliques 
used were only edges or triangles). Therefore 


f(a+2)< fajtne4el 


Since ae : 
ne n 
[eo] = [=] tnt 
it follows that f(n) < [=]. Therefore the answer is [] i a 


It is not hard to guess the answer of the following problem, but proving 
that this is the correct answer is quite a pain in the neck. 
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13. What is the least number of edges in a connected n-vertex graph such 
that any edge belongs to a triangle? 


Paul Erdés, AMM E 3255 


Proof. Let f(n) be the desired number and let g(n) = [2852]. We will prove 
that f(n) = g(n) for all n. To save words, call good a connected graph in 
which every edge belongs to a triangle. 

We can easily establish that f(m) < g{n) by explicit constructions: if 
n = 2k +1, consider & triangles sharing a common vertex, this being the only 
common point of any two triangles. This is a good graph with 3k edges, so 
f(Qk+1) < 3k = 92k +1). If n = 2k, start from the above good graph with 
order 2k ~ 1, choose an edge AB and add a vertex X and edges AX, BX. We 
obtain a good graph with n vertices and g(n) edges. 

The hard point is proving that any good graph on n, vertices has at least 
g(n) edges. We will prove this by strong induction. For n = 3,4, this is easily 
checked. The crucial observation that makes the induction work is that a good 
graph with n vertices and less than g(n) edges must have a vertex of degree 
9. This is trivial, since by connectedness all vertices have degree at least 1 
and since every edge is in a triangle, every vertex must have degree at least 2. 
However, the sum of the degrees of all vertices is twice the number of edges, 
thus smaller than 37. 

Suppose now that f(j) = g(j) for all j <n and let us prove that f(r) > 
g(n). Suppose G is a connected graph with f(n) edges such that each edge 
belongs to a triangle and suppose f(n) < g(n). There is a vertex a of G of 
degree 2, which by assumption must be in some triangle zyz. Suppose the edge 
yz is in a second triangle. Then the graph G’ obtained from G by removing z 
and the edges xy and xz is connected, has n—1 vertices, every edge belongs to 
a triangle, and f(n) ~ 2 < g(n) —2 < g(n ~ 1) edges. This is a contradiction. 
Thus yz is not contained in another triangle. Form a new graph G” by deleting 
the vertex x and collapsing the vertices y and z into a single new vertex w, 
where a vertex v of G” is joined to w if in G it. was joined to either of y or z. 
Clearly G” is a connected graph with n -~ 2 vertices and every edge of G” is 
contained in a triangle. In forming G”, we lost three edges wy, yz, and xz. We 
would also lose an edge if some vertex v were adjacent to both y and z, but by 
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assumption this does not occur. Thus G” has f(n) - 3 < g(a) —3 = 9(n— 2) 
edges. Again this contradicts the induction hypothesis. Thus we must have 


f(n) 2 a(n). 


The following result is classical and very nice. We give two proofs, the 
first one being an explicit inductive construction, the second one using the 
powerful probabilistic method of Erdés. We refer the reader to the addendum 
6.B for more details on this versatile tool. 


14. Prove that for every n there is a graph with no triangles and whose 
chromatic number is at least n. 


Mycielski’s theorem 


Before starting the proof, let us recall some definitions. A k-proper coloring 
of a graph G is a coloring of its vertices with at most k colors such that each 
vertex receives one color and no two adjacent vertices have the same color. 
The chromatic number of a graph is the smallest number & for which G has a 
proper k-coloring. 


Proof. We will prove this by induction on n. For n = 1 everything is clear, 
so assume that we have a graph G with no triangles and chromatic number 
at least n and let us construct another graph with no triangles and chromatic 
number at least n-+-1. Let x(G) be the chromatic number of G. We may assume 
that x(G) = n (otherwise keep G for the inductive step). Let v2, x sR be 
the vertices of G and consider the following new graph G’: the set of vertices 
consists of 1,,..., v4, together with vertices #1,..., 2%, y (such that the 2k-+1 
vertices thus obtained are distinct). Connect 2; with all neighbors of v; and 
connect y with z,...,a%. Also, keep the edges in G. We claim that G’ is 
triangle free and x(G’) > n+ 1. It is clear by construction that G’ has no 
triangles. Also, we easily have x(G") < x(G) + 1, since any proper n-coloring 
of G extends to a proper n + 1-coloring of G' (assign the same color to Xj as 
to 4; and assign the color n-+ 1 to y). 

The difficult part is proving that this is actually an equality. Suppose 
thus that G’ has chromatic number at most n and take any proper n-coloring 
of G’. We will construct a proper 7 —~ 1-coloring of G, which will contradict 
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the fact that G has chromatic number n. Assume, without loss of generality, 
that y has color n and let A be the set of vertices of G whose color is n and 
for each v; € A change its color with that of 2;. We claim that this gives a 
proper n — 1-coloring of G. Note that no two vertices of A are adjacent, as 
all these vertices have the same color. Now, if v; € A is adjacent to 0; ¢ A, 
then v; is adjacent. to 2;, so that they have different colors. Consequently, we 
found an — 1-proper coloring of G, which is impossible. We deduce that the 
chromatic number of G’ is at least n + 1. 


Proof. We will actually prove a stronger result, which is a classical theorem of 
Erdés. We will use the probabilistic method, for which the reader is referre 
to the addendum 6.B. 


Theorem 6.2. For any positive integers k,n there exists a graph with chro- 
matic number greater than k and such that each cycle has length greater than 
n. 


Take N sufficiently large and consider random graphs with N vertices 
Gy p, each edge appearing independently, with probability p = N°&1 for some 
0<ce< i. If X is the number of cycles of length at most n in Gyp (which 
cycles we will call short in the sequel), then clearly 


since there are at most N* cycles of length i, each appearing with probability 
p’. Since the last quantity is smaller than & for sufficiently large N, it follows 
by Markov’s inequality that 


N 1 
>— |<. 
P(x> y) $3 


Let a(G) be the size of the largest independent set in G (the independence 
number of the graph) and let x(G) be the chromatic number of G. Then it 
is easy to see that x(G) > ats if G has N vertices. Taking a = B InN], 


the probability that o(G) > a is at most (”) .(1— p)() (one can choose a 
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independent vertices in (2) ways and the probability that a given set of @ 
vertices is independent is certainly (1 — p)2)), An easy computation shows 
that the last quantity tends to 0 as N — oo. Thus for N sufficiently large 
we have P(a{G) > a) < 3 and so there is a graph Gpn = G with X < $ 
and a(G) < a. Now, delete one vertex from each short cycle arbitrarily, The 
remaining graph Gy; has at least N/2 vertices, no cycles of length at most, n 
and independence number smaller than «. Thus 
N/2 
SS oes, 
xGi) 2 Siren 7 


for sufficiently large N. The result follows. 


We end this chapter with three gems in extremal graph theory, all taken 
from mathematical contests. 


Ys. For a pair A = (21,4) and B = (x2, y2) of points on the coordinate 
plane, let 
dA, B) = |, — 2| + lyn — yal. 
We call a pair (A, B) of (unordered) points harmonic if 1 < d(A, B) < 2. 


Determine the maximum number of harmonic pairs among 100 points 
in the plane. 


USA TST 2006 


Proof. Consider the graph with vertices on the 100 points and whose edges 
connect points of harmonic pairs. The key point is that this graph contains no 
5-clique. Indeed, if P;(4i, y;) are five vertices, any two of which are connected, 
then for all i 4 j we have 1 < d(P;,P;) < 2. Order these points such that 
a) < #2 <--- < a5, It is easy to see that among the numbers y1, ye,....¥5 
one can find three forming a monotonic sequence. Call these three numbers 
Vix Vig: Yig- Then 


; dP, Pin) + (Pin, Fis) = d( Pi, Pig). 


This is however impossible, since d(P;,, Pig). d(Pis. Piz), Pigs Pi) € (1,21. 
This finishes the proof of the fact that our graph contains no Ks. By Turdn’s 
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theorem, it has at, most ge a == 3750 edges and so there are always at most 
3750 harmonic pairs. 

To finish the proof, it remains to exhibit a configuration having 3750 
harmonic pairs. It is enough to distribute the 100 points near the points 
(0, +4) and (43,0), having 25 points near each of these four points. o 


16. For a finite graph G let f(G) be the number of triangles formed by the 
edges of G and let 9(G) be the number of tetrahedra formed by the edges 
of G. Find the least constant c such that g(G)* < e- f(@)* for any finite 
graph G, 

IMO Shortlist 2004 


Proof. Let us begin by considering the case of a complete graph Ky. Then 


obviously 
f(Qe= G) and g(G) = (). 


o> WKn)> _ 
wiLommnay 
and this happens for all n. Taking the limit as n -+ 00, we deduce that ¢ > %. 
The hard part is to prove that the value 3 actually holds for arbitrary 
graphs G’. To do this, we will first consider the two dimensional version of the 
problem, which is comparing the number of triangles and edges in a graph. 
The reason is simple: computing tetrahedra in a graph G comes down to 
computing triangles in all subgraphs induced by the vertices of G and taking 
the sum of these numbers of triangles (and then dividing by 4, since each 
tetrahedra is counted four times). 
Therefore, consider first any graph G with n vertices 2, 29,---,%, and 
e edges. Let us bound the number of triangles in terms of e. If d(x) is 
the degree of the vertex 2;, then there are at most (4) < day triangles 
containing the vertex 2;, But the number of triangles <oniabning the vertex 
a; is also obviously bounded by the number of edges of G, that is e. Thus for 
each vertex 2;, there are at most d(2,) - J§ triangles containing the vertex 


Thus 
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x;, Summing over é and taking into account that we count each triangle three 
times in this way, we obtain that 


7 é€ 2 fe 
HO<s ii ‘Sate = Zey/S. 


Note that the constant appearing in this estimate in optimal, by taking for 
G = Ky, and then letting n + oo. This already shows that we are on the right 
path, since we solved the two dimensional version of the problem. 

To solve the original problem, all we have to do it to repeat some of the 
arguments in the previous paragraph. Namely, take a graph G with vertices 
£1,..-,0, and fix a vertex x; The number of tetrahedra containing 2; as 
a vertex is the number of triangles in the subgraph induced by the vertices 
connected to z;. Let e; be the number of edges in this subgraph, then the 
previous paragraph shows that there are at most 2e ve tetrahedra containing 
x, as vertex. Also, note that we have 3f(G) = )> e;, since for each vertex x; 
there are ¢; triangles containing x; as vertex (and we count each triangle three 
times this way). However, one still needs an observation to end the proof: we 
gave an estimate for the number of tetrahedra having «; as vertex in terms 
of ¢;, but there is also an obvious estimate: this number is at most f (G). 


Therefore, there are at most 3 2 F(G)e; tetrahedra containing x;. Summing 
over ¢ and taking into account that we count each tetrahedron four times, we 


finally obtain 
49(G) < 5° VK Pa@). 3f(G). 


Taking the cube of the last inequality yields the desired estimate 
Fas S- Ke) 
. ~ 32 
and ends the proof. ga 


¥17. Let & be a positive integer. A graph whose vertex set is the set of positive 
integers does not contain any complete k x k bipartite subgraph. Prove 
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that there are arbitrarily long arithmetic progression of positive integers 
such that no two elements of the progression are joined by an edge in 
this graph. 

KéMaL 


Proof. We will first prove the following result, of independent interest: 


Lemma 6.3. Let a > 2 and let Ka, be the complete bipartite graph with a 
vertices in each half of the partition. There is a constant c > 0 such that a 
graph with n vertices and without Kao has at most cn?~= edges. 


Proof. Let G be such a graph and call 1,2,...,n its vertices. Let dj be the 
degree of vertex i. Let us count pairs (v, {vi,...,va}), where {vi,...,va} is 
a set of a (distinct) vertices, all connected to v. For each v = i, there ate 
precisely (#) sets {vj,..., va} sharing a pair with v. Thus we find 37%., (“) 
pairs in total. On the other hand, for each set of @ vertices {v1,..., vg} there 


are at most a ~- 1 vertices v sharing a pair with {u,...,va}. Thus 
Rg ie 
& (¢) s@-() 
a a 
i=1 


Let ¢ be the number of edges in the graph and let f(x) = @) ife>a-1 
and f(x) = 0 for 0< a <a-—1. Then f is convex and since >\d; = 2q, we 


deduce that * 2 
w-n(2) en) 


This yields the rough estimate 


a 
ant>n(2—a41) c 


from which the result follows immediately. 


Coming back to the proof, fix an integer m > 1 and suppose that among 
any m vertices forming an arithmetic progression, at least two are connected. 
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Fix a large integer N and consider all arithmetic progressions with m terms, 
all among 1,2,..., 4’. There are at least 


such arithmetic progressions, since each is determined by a pair (a, d) of posi- 
tive integers such that a+ (m— 1d < N. This is greater than c(m)N? for N 
sufficiently large, where c(m) > 0 is a constant depending only on m. Since 
each of these progressions contributes an edge to the graph Gy (the subgraph 
of G induced by the vertices 1,2,...,.N ), and since each edge is counted at 
most m? times, we deduce that Gwy has at least oe) N? edges for all suffi- 
ciently large N. Since oe) 2 > cN?-® for all sufficiently large N (and this 
for any constant c > 0), it follows by the previous lemma that Gy contains a 
Ky for all sufficiently large N, Since this contradicts the hypothesis on G, 
the conclusion follows. a) 


6.1 Notes 


We would like to thank the following people for providing solutions to 
some of the problems presented in this chapter: Alon Amit (problem 12), 
Alexandru Chirvasitu (problem 10), Xiangyi Huang (problem 3), Szymon Ku- 
bicius (problem 4), Joel Brewster Lewis (problems 2, 7, 15), Fedja Nazarov 
(problem 14), Richard Stong (problem 13), Gjergji Zaimi (problem 5) 


Addendum 6.A Some Pearls of Extremal 
Graph Theory 


The purpose of this addendum is to present some beautiful theorems 
from extremal graph theory, that nicely complement the more elementary 
results discussed in chapter 6. The main result is the famous Szemerédi-Trotter 
theorem, a deep result in incidence geometry which has a lot of wonderful 
geometric consequences. For instance, it is the basic tool in dealing with 
natural (but very hard) questions such as; if we are given n points in the plane, 
how many distinct distances do they always determine, what is the greatest 
number of segments of length 1 (or triangles of area 1) they determine, etc. 
Thanks to a brilliant observation of Elekes, it also plays an important role in 
additive combinatorics, giving nontrivial bounds on the so-called sum-product 
problem, a major research problem in modern combinatorics. 

The results discussed in this addendum found wide ranges of applications 
and their extensions are a very hot topic of research. See the papers [1], [11], 
[15], [22], [23], [36], [74], [72], [73], [75], [80], [81], and the excellent book 
[82] for more details, as we will only scratch the surface of the subject (but 
hopefully that will be enough to convince the reader of the beauty of these 
results), Also, some proofs in this addendum use the probabilistic method, so 
the reader is invited to take a look at addendum 6.B for details on this method 
and for probabilistic vocabulary. 


6.A.1 The Szemerédi-Trotter theorem 


The famous Szermerédi-Trotter theorem deals with the number of inter- 
sections between geometric objects, giving a fairly nontrivial (and even sharp 
up to absolute constants) upper bound for the number of incidences between 
a family of points and a family of curves. Since we do not want to delve into 
subtle topological considerations, let us agree that curves will always mean 
ares of circles or polygonal lines, as this appears in all applications we have in 
mind. 


Theorem 6.A.1. (Szemerédi-Trotter) Consider n points P,, P2,...,Pn and 
m. curves Ci,C2,...,Cm in the plane. Suppose that C; and C; have at most 
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one common point for alli # j. Then the number of pairs (i, J) such that 
P, € Cj is at most m + 4max(n, (mn)3). 


Tn applications, the following immediate consequence is very handy: 


Corollary 6.A.2, Let S be a set of n points in the plane, L a collection of 
curves such that any two curves have at most one common point, Suppose 
that each curve in L contains at least k > 2 points of S. Then the number 
of pairs of a point in S and a curve in L containing that point is at most 
2°. max (n, gS ; 

Proof. Let | = iL| and let p be the number of such pairs. The theorem 
gives p < 1+ 4n3 max(ni, 13). By hypothesis, we have p > lk, therefore 
p-l> pl ~-£2 §, Hence our inequality becomes p < 8né max(n3, (p/k)3). 
From this the result follows by considering two cases. o 


It is important to note that corollary 6.4.2 {and also Szemerédi-Trotter’s 
theorem) is sharp up to a constant. Suppose that 2k? <n and consider the 
set of points (x,y), where 1 <a < k,l < y S = and x,y are integers. Also, 
consider the set of lines y = mx +6, where m,b are positive integers such that 
lsb<#andl<gm< 3px. It is easy to check that any such line contains 
exactly & such points (namely all points (4, mi+ 6) with 1 <i <k). Moreover, 
there are about n points and about 2 Ines. Also, if k > \/7, one can simply 
pick about n/k lines and put k points on each of them. 

The original proof [81] of theorem 6.A.1 was very intricate. In a beautiful 
paper [80], Székely gave an amazing proof using a graph-theoretic result on 
crossing numbers. We will follow this path, but in order to do that we first 
need some terminology. Let G be a graph and consider an injective map that 
sends vertices of G to points of the plane. Draw an arc (in the plane) between 
any two points that come from the endpoints of an edge of G, such that except 
for the endpoints this arc does not meet any vertex, Call such a map a drawing 
of G. A crossing (or crossing point) is simply the intersection of two arcs not 
at the image of a vertex. Such a point belongs therefore to at least two arcs, 
but it is not an image of a vertex of the graph (with a piece of paper and a 
pencil in front of you, all these definitions should be obvious!). The number 
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of crossings is the total number of crossing points, counted with multiplicities 
(ie. if a crossing point belongs to & arcs, its multiplicity is (8). It is also the 
number of pairs of edges with no common endpoints and. whose associated arcs 
intersect each other. After this dry list of obvious definitions, let us glorify 
one that will be very helpful in what follows: 


Definition 6.A.3. The crossing number of a graph G, denoted ¢(@), is the 
minimal number of crossings over all possible drawings of G. 


Note that, by definition, a graph is planar if and only if its crossing number 
is 0. The key ingredient in Székely’s proof of theorem 6.A.1 is the following 
deep theorem [1], that will be proved in the next section. 


YTheorem 6.A.4, (Ajtai, Chudtal, Newborn, Szemerédi, Leighton) Let G be 
@ simple graph with e edges and n vertices. If e > 4n, then c(G) > ado: 


Let us see why theorem 6.A.4 implies theorem 6.A.1. We may assume that 
every curve C; contains at least one of the points P;, Po, P,,. Consider the 
graph G whose vertices are P;,P2,,..,P, and whose edges connect adjacent 
points on some C; (i.e. two points P;, P, belonging to some Cj, such that 
there is no other point P; between them on C;). Let e be the number of edges 
of G and let J be the number of incidences between points and curves (i.e. 
the number of pairs (7,7) such that P; € C,). As every curve contains at least 
one point among P;, P2,..., Pa, we have e = I ~ m (since if a curve contains 
s points, it yields s ~ 1 edges of G and edges coming from two different curves 
are distinct). Now, since two curves intersect in at most one point, it follows 
that c(G) < m?. If e < dn, we deduce that J < m-+4n and we are done. 
Otherwise, the previous theorem yields m? > a thus e < A(nm) 3 and so 


T<m+ A(nm)3 . The result follows. 


6.A.2 Proof of theorem 6.A.4 and a generalization 


The proof of theorem 6.4.4 is simply beautiful: we will start with a rather 
weak inequality obtained from Euler's formula and, using the probabilistic 
method, we will improve it drastically by averaging. Let us start with the 
weak inequality: 
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Lemma 6.A.5. IfG is a graph with e edges and n vertices, then ¢(G) > e~3n. 


Proof. First, we reduce the proof to the case when G is planar. To do this, 
remove edges of G' which cross other edges until this is no longer possible, so 
you end up with a graph G’ which is planar. Suppose that you removed k 
edges of G, Each of them removes at least one crossing, so that if we accept 
the truth of the lemma for G’, we can write 


Now, assume that G is planar, so c(G) = 0. We need to prove that e < 3n. 
Ifn < 2, this is clear, so suppose that n > 3. Let f be the number of (possibly 
infinite) faces of G. By Euler’s formula we have n+ f = ¢ +2. As every face 
has at least three edges, we have 3f < 2e. We easily deduce that e < 3n — 6, 
finishing the proof of the lemma. 

Finally, let us recall the proof of Buler’s formula: if you start with a single 
vertex and try to reconstruct the graph, then every time you add an edge, you 
either add a vertex or a face, so the quantity n + f — e — 2 does not change. 
Since initially it is obviously zero, it is zero all the time. Oo 


Now, we use the probabilistic method to improve the previous inequality. 
Take an arbitrary number p € (0, 1] and consider a random induced subgraph 
HT of G, by picking each vertex of G independently and with probability p. As 
the probability of a given vertex to be in H is p, by linearity of expectation 
we have Efo(H)| = np, where v(H) is the number of vertices of H. Also, since 
an edge appears in H with probability p?, we have Ele(H)] = pe, where e 
{respectively e(H)) is the number of edges of G (respectively H). 

The previous lemma and linearity of expectation yield 


Ele(H)] > Ble(H)} ~ 3E[v(H)). 


We cannot easily express E[e(H)] in terms of e(G), but we can at least say that 
Ele(H)} < ptc(G). Indeed, take a drawing of G with exactly ¢(G) crossings 
and observe that the probability that a crossing survives in H is p’. The 
conclusion is that 

pic(@) 2 Ele(H)] > pe — anp. 
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As p was arbitrary, it is tempting fe Bees a value that will optimize the 
previous inequality. This value is p = ge , but unfortunately it is not necessarily 


smaller than 1. However, this is far from being a subtle issue: simply choose 
ar 


something a bit smaller, namely p = **, This finishes the proof of the theorem. 

In geometric applications, it is useful to have a more flexible version of 
theorem 6.A.4, which allows graphs with multiple edges. This is the objective 
of the following theorem of Székely [80]. 


Theorem 6.A.6. Let G = (V,E) be @ multigraph with n vertices and € edges. 
If the maximal multiplicity of the edges of G is m, then e < 32nm or c(G) > 
3 


Benim * 


Proof. The idea is to reduce everything to the case of simple graphs, where 
theorem 6.A.4 applies. The details are however a bit involved. For each 
1 <i < loggm+1, let G; = (V,E;) be the subgraph of G using only those 
edges with multiplicity between 2'~! and 2¢ (more precisely, two vertices a,b 
are joined by k edges in G; if they are joined by & edges in G and k € [2*1, 2°)). 
Let e; be the number of edges in G;, without multiplicity. In order to apply 
theorem 6.A.4, we will restrict to those i for which e; > 4n. Let S be the set 
of these 7. As G; has maximal multiplicity at most 2, we have (under the 
assumption that e > 32:n, which will be made from now on) 


1+ [loge (m)} ; é 
SEG) =e-S\EGNze- SO 4n-%Be~i6nm> _ 
ies gS é=1 
We claim that for i € S we have 
AG) 2 64n? 


Indeed, pick an arbitrary drawing of G; and for every pair of vertices of G, with 
a multi-edge between them, select 2’—! of these edges and then arbitrarily and 
independently only one of them. We get a family of (2'~!)% simple graphs, 
each having e; > 4n edges (as i € S) and so with crossing number at least 


ab. Thus, over this whole family of graphs we obtain at least at- De 
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crossings. However, one has to pay attention to the fact that each crossing is 
counted in 2@-0(-2) members of the family, so in total we obtain at least 


meg 3 1,3 
ale Sa ae 
20-1 ~ 64n2 
crossings in Gj, proving the claim. 
Finally, by definition of the crossing number, it is clear that 


©) 2 YG) 2 YG) > 3 mee a 


eo ies 


Now, since 


it is natural to use Hélder’s eer in the form 


(x va) (x ae S a) 24 se 


Combining this with the easy estimate 


I+-floga(m)] 


Soe? s SO a? avon 


ies i=l 
yields 
1 3 3 
CG) > 5 es Se 
KS) 28n2 8 - 32m ~ 26 pF’ 
finishing the. proof. Ct 


6.4.3 An application to additive combinatorics 


In [23] Elekes made a nice connection between theorem 6.A.1 and a famous 
problem of Erdés and Szemerédi, the sum-product problem. This asks for a 


a 
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sharp lower bound of the expression max(|A + A|,|A- Al) over all sets of real 
numbers A with 7 elements. Here 


A+A= {a+bla,b <A} and A- A= {a-bla,b€ A}. 


In particular, they made the very deep conjecture that for any ¢ > 0 there 
exists ce > 0 such that for all sets A with sufficiently many elements we have 
e 


max(|A + A],|A- Al) > ce: [APS 


8 7 
The following result treats the case when 2 ~¢ = 3. In [72], Solymosi proves 
the case 2—¢ = #- 


Theorem 6.A.7. (Elekes) For any finite set of real numbers A we have 
1 45/2 
{A-AL> anf AP? 
[A+ Al-J4- Al > spl 

Proof. Consider P = (A+ A} x {A- A) as a set of points in R. Let L be the 
set of lines I, with equations y = a(x ~ 6) ranging over all choices of siete 
a,b of A. Note that all lines /,,, with a € A ~ {0} and 6 € A are distinct, so 
iI > |A|((A] - 1). Also, ta is incident with (6 + c,a- c) € P, for any c€ A. 
Thus we have at least {Aj - {Z| incidences between L and P. Thus, by the 
Szemerédi-Trotter theorem, we have 


|A] [LE] < [L} +4 max(|PI, (PI L£1)*/9). 
If [P| > |L/’, then clearly 
[P| > |AP?- (Al ~ 1)? = 


and we are done. Otherwise, the previous inequality yields 


3/2. 
riz (4) Viti > Vial © 


o 


and the result follows. 
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The variant of the sum-product problem in which instead of sets of real 
numbers one considers subsets of a finite field has generated a huge amount 
of work. The proofs are in general very technical, as the analogue of the 
Szemerédi-Trotter theorem over finite fields is wrong. One has nevertheless 
the following deep theorem [11]. 


Theorem 6.A.8. (Bourgain, Katz, Tao, Glibichuk, Konyagin) 


Lete > 0. There exist positive constants C,6 such that for any prime p 
and any AC Fy with |Al < p'-© we have 


max(|A + AJ,}4- Al) > CLAIM, 
The analogue of the Erdés-Szemerédi conjecture for finite fields would be 
max(|A + Al, |A- Al) > ¢(e) min(|AP~*, g'-*) 


for subsets A of F, - This is far from being settled. The following theorem can 
be deduced from the sum-product theorem. 


Theorem 6.A.9. (Bourgain, Katz, Tao) Let O<a< 2. There exists ¢>0 


and C > ( such that there are at most Cni~< incidences between n < 


pe 
points and n < p® lines in Fy. 


Note that the previous theorem no longer holds if one considers subsets 
A of Fy, where g is not a prime number. It suffices to consider a nontrivial 
subfield of Fy, for which |A-+ A] = |A] and |A- Al =} A]. Also, it is necessary 
to have some upper bound on |A|, since otherwise A = Fy, would be again a 
counterexample for p large enough. 


6.A.4 Some geometric applications 


In this section, we present some nice geometric applications of Szemerédi- 
Trotter’s theorem. The first result deals with the number of triangles of unit 
area spanned by n points in the plane. It is not difficult to construct examples 
in which there are at least en? log n unit-area triangles, for an absolute constant 
¢ > 0, but it is rather hard to give nontrivial upper bounds for the number 
of such triangles. We will use an easy geometric argument combined with 
corollary 6.A.2 to give such an upper bound. 
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Theorem 6.A.10. There exists an absolute constant c > 0 with the following 

property: the number of triangles of area 1 with vertices among n points in the 
7 

plane is smaller than cn3. 


’ Proof. Let P be aset of n points in the plane. The essential observation is the 


following: if a,6 are points of P, then all points p € P such that the triangle 
abp has area | are on the union of two fixed lines Lp, Lh parallel to ab. — 
is immediate. Now, consider those paits (a, 6) for which the lines LE, and Ly» 


i i at On l/3 
contain at most 23 points of P apiece. Each such pair determines at most 2n. f 


unit-area triangles and so these pairs contribute at most 2n?-nl/3 = an! 3 
unit-area triangles. On the other hand, by corollary 6.A.2, there are O(n*/3) 
incidences between lines | containing at least n!/? points of P and points of 
P. However, given such a line 1, there are O(n) pairs (a,b) such that 1 = Las 
(as for any a and J there are at most two points 6 such that / = Lap). Thus 
pairs (a,b) for which at least one of the lines 1,5 and L{,, contains more than 
n¥/3 points of P also determine O(n-n*/4) = O(n™/*) unit-area triangles. The 
result follows. o 


i i nolat 
Using a lot more work, one can improve the previous result to O(7 (Ate) 


for any > 0 as shown in (22. The exact order of growth of the number of unit- 
area triangles is unknown. If instead one considers the number of triangles 
with perimeter 1, the same kind of reasoning (but working with incidences 
between ellipses and points) yields a bound O(n'®/7). 

The second application is a famous result of Beck. [t is again an easy 
consequence of corollary 6.4.2. 


Theorem 6.A.11. (Beck) For any n points in the plane, either more than 
n-27'4 of them are on a line or these points determine at least 2-n? different 
lines. 


Proof. Let S be a set of n points in the plane. Call a line average if it contains 
between 2!4 and n/2" points of S. We claim that there are at most n?/2 pairs 
of points of S that determine average lines. Indeed, for each 7 € [14, logs 37] 


. ; i, i-1 
there are at most 2° - max (2, =) lines that contain between 2’ and 2° 


points of S, by corollary 6.A.2. Any such line contains at most 4'*" pairs of 
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points of S. Thus the number of pairs of points that lie on average lines is 
bounded by 


2 2 
é. nm nr nv : 
2° S ; 4'tl ax (§-5) <u 3 = + S : gen 
l4cislogs siz 14<i<logyn logy n<éSlogs sf; 


proving the claim. 

So we have at least n”/2 pairs of points of S that do not determine an 
average line. Now, we have two possibilities: either some line contains more 
than n/2™ points of S, in which case the result is proved, or actually the n?/2 
pairs of points determine only lines that contain less than 2!4 points of S. But 
then each such line contains at most 278 pairs of points of S, so there must be 
at least n*/2° such lines. So again the result is proved. Q 


We continue with another rather natural question: what is the maximal 
number of unit-distances spanned by 1 points in the plane? Erdés proved 
the lower bound n'*Petee% and the upper bound cn/? and conjectured that 
the true order of growth should be about n'*®eiE, This is far from being 
proved, but the following theorem gives a better upper bound. In dimension 
3, Clarkson proved that the number of unit distances is O(n2 t= ) for all e > 
0, while Erdés, Pach and Hickerson gave examples with at least cn4/? unit 
distances, Also, note that already in dimension 4 one might have en? unit 
distances, as shown by placing n points on att x2 = 1/2 and n points on 
a+ xi = 1/2, 


Theorem 6.4.12. (Spencer, Szemerédi, Trotter) Let § be a set of n points 
in the plane. Then there are at most 16n‘/? ordered pairs of points (P, Q) in 
S such that PQ=1. 


Proof. Draw a unit circle around each point of § and consider the multigraph 
G with vertex set S and in which P,Q € S are joined if they are adjacent on 
one of these circles. Note that there might be more than one edge between two 
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vertices and that G might have loops (if there are circles containing exactly 
one point of $). If ¢ is the number of unit distances between points in S, 
then G has q edges, for if a circle centered at P & S contains 2; points of 
S, these points contribute x; edges to G. Now, remove all circles containing 
at most two points of S, so we get a new multigraph with at least ¢ — 2n 
edges. We claim that the maximal multiplicity in this multigraph is at most. 
2. Indeed, if there are at least 3 edges between P # Q € S, then there are at 
least three circles containing P,Q, so the circles of radius 1 centered at P,Q 
have at least three common points, a contradiction. Next, for every pair of 
vertices with exactly two edges between them, remove one edge (arbitrarily), 
so that we end up with a simple graph having at see £ ~n edges. Now, if 
q > 10n, then this simple graph has more than max({ 4 4n) edges and so at 
least (¢/4)°/(64n?) crossings by theorern 6.4.4, But since any two circles have 
at most two common points, there cannot be more than 2(5) < n* crossings. 
The conclusion is that if ¢ > 10n, then q* < 4096n4 and the result follows (if 
¢ < 10n, everything is clear). im 


We end this addendum with a rather technical result concerning a famous 
question of Erdés: what is the least number of distinct distances determined 
by 7 points in the plane? Though rather innocent-looking, this is an extremely 
difficult. problem. Erdés gave examples of configurations which determine at 
most en distances, for some absolute pan c. For more than thirty 


years, the best result was Moser’s bound cn?/5, which is a rather tricky, but 
very elementary argument. Note that this bound also follows from the previous 
theorem: there are at least 4 ordered pairs of points and the previous theorem 


{combined with a seedling argument) shows that each distance a spears at 
4/3 3 


most 16n*/° times, thus the number of distinct distances is at least - It was 
only in 1984 that Chung [15] improved Moser’s bound to .en®/?. The next big 
step was done in the paper [16], where bounds of the form her are proved 
by rather difficult arguments. After a huge effort, a major breakthrough was 
made in [36], where Guth and Katz prove that n points always determine at 
least ; distinct distances. Here, we prove a much weaker estimate, but 


ogre n 


which is already highly nontrivial (following [80/). 
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Theorem 6.4.13. (Székely) Any set of n points in the plane determines at 
least cn‘!® distinct distances, for an absolute constant c > 0. 


Proof. From now on c is an absolute constant that will change throughout the 
proof without changing its name. 

Fix rn distinct points P,, P2,...,.P, in the plane and let t be the number 
of distinct distances among them. Let d),do,...,d, be these distances. We 
may assume that t < ni, as otherwise we are done. Around each point. P; 
draw ¢ circles, having radii di, da, ...,d,. Consider now the multigraph whose 
vertices are Pj,..., P, and whose edges are arcs connecting adjacent vertices 
on these circles. Note that there are at least n(n —~ 1) edges in this multigraph 
(for each P;, there are n—- 1 points on the union of the circles centered at P; 
and if a circle contains s points, it contributes s edges). As usual, we remove 
edges that come from circles containing at most two points. It is easy to see 
that this removes at most 2nt edges, so we still have n? + o(n?) edges. 

Unfortunately, the maximal multiplicity might be too big for the previous 
theorems to yield the desired estimate. The crucial point is to show that there 
cannot be too many edges with very high multiplicity. This is the aim of the 
following lemma. Before stating it, call a pair of points k-rich if there are at 
least k edges between them. 


Lemma 6.A.14. There are at most stat +etnlogn edges joining k-rich pairs 
of points. 


Proof. By definition, this number of edges is at most the number of pairs 
(d,e), where d is the symmetry axis of an edge ¢ of G such that d contains at 
least, k vertices. If 2' < /n, there are at most & Se * lines containing at least 2! 
vertices, by corollary 6.4.2. If such a line contains uv points then, by definition 
of t, this line bisects at most 2tu edges. So, as long as we are counting pairs 
(d,e) such that d contains between k and 4\/n vertices, the total number of 
such pairs is at most. 


2 
ina oR 
Lee 


k/2<Bca/a 
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and an easy computation shows that this is bounded by sty On the other 
hand, for each a > 4,/n, it is easy to see that there are at most @ lines 


@ 
containing between a and 2a vertices. These lines yield at. most 


Ss Seegitt < ctnlogn 
aYn<2icn 


pairs. Combining these two observations finishes the proof. B 


Finally, the previous lemma shows that there is an absolute constant ¢1 
such that if we delete all ce; /t-tich edges, the resulting multigraph G) has at 
least. on? edges. As clearly G1 has crossing number at most 2n7?, it follows 


from theorem 6.A.6 that é 


QnPt? > 
~ nla VE 


and the result follows. 


Addendum 6.B_ Probabilities in 
Combinatorics 


This addendum presents some applications of probabilities in combina- 
torics, or what is commonly called the probabilistic method. This is one of 
the most powerful tools in modern combinatorics, even though, just as with 
the pigeonhole principle, the underlying idea is extremely easy. The proba- 
bilistic method was used at first by Erdés, who proved a great deal of fairly 
nontrivial results using rather simple probabilistic arguments. For a much 
deeper study of the method, a canonical reference is the excellent book [2] by 
Alon and Spencer. Many of the examples we are going to discuss are taken 
from this book (however, some of them are left. as exercises in the book and 
are not really easy... ), 

We begin by recalling some useful notions from probability theory. In 
discrete combinatorics, one works with finite probability spaces, which makes 
the discussion much more elementary, avoiding subtle issues from measure 
theory. A finite probability space is the data of a finite set Q (called the 
sample space) and of a map (called the probability distribution) P : Q = {0, 1] 


such that 
So Pw) =1 
WER 


One obvious such map is the constant map ap which is called the uniform 
distribution. One defines for any subset, A of b, which we call an event, its 
probability as 
P(A) = 3° PC). 
ZEA 
It is very easy to check that P(AUB)+ P(ANB) = P(A)+ P(B) and that the 
probability of the complement of A is 1 ~ P(A). Given a probability space, a 
random variable X is simply a map X :Q —> R. The expectation {or mean 
value) of X is 


E[X] = $0 X(w)Pw) = Soa: P(X = 2). 


we #eR 
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Note that the second sum is finite, as the image of X is finite. Another ex- 
tremely important notion is that of independence, Some events Aj, A2,-.., Ax 


are called independent if for all J c {1,2,...,£} we have 
P(MerAs) = [] P(Ad)- 
ie] 


Two random variables X,Y are called independent if the events X = a and 
Y = b are independent for all a,b. It is an easy exercise to check that in this 
case E[XY] = E[X|E/Y]. 

The following theorem consists of two elementary, but incredibly powerful 
principles. The first one is basically the principle of the probabilistic method: 
if you want to show the existence of an object with properties P;,..., P, it 
is enough to prove that there is a probability space (Q, P) such that the sum 
of the probabilities that an object does not have property P; is smaller than 
1. Of course, the difficult point is constructing the good probability space 
(though in practice the construction is naturally imposed by the statement of 
the problem we are trying to solve) and estimating these probabilities: The 
second part of the theorem is called linearity of expectation and it is also a 
very powerful result, as we will soon see. 


Theorem 6.B.1, Let (2, P) be a finite probability space. 


1) For any subsets Aj, Ag,..., Ap of Q we have 


k 
P(E Ai) < 35 P(AD), 


i=] 


with equality if the events are pairwise disjoint. So, if ey P(A) <1, 
then UA; #2. 


2) If X1,Xa,.-.5. X, are random variables, then 


E[X, +Xo+-+- +X] = E[Xy) + E[Xo] +--+ BLXy]. 


Proof. 1) Let By = A, and let B; be the complement of uit Aj in A; for all 


a> 2. Then clearly U;B; = U;A; and the B;’s are pairwise disjoint. But then 
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it ant that P(B;) < P(A;) (as Bj C A; and P takes nonnegative values) 
ani 


P(U;A;) = P(U;B;) = $7 P(B) < > P(A). 
The rest is immediate. 


2) is an easy consequence of the definition of expectation. QO 


: When using the probabilistic method, one is naturally confronted with 
estimating probabilities, which sometimes can be quite painful. A simple 
example is that if (2, P) is a probability space and if Y is a random variable 
on Q, then the definition gives that there exists w € Q such that _X (w) > E[X]. 
The following two inequalities are also very basic, but useful tools in estimating 
probabilities: 


Theorem 6.B.2. 1) (Markov’s inequality) If X is a random variable tak- 
ing nonnegative values and if a > 0, then 


PIX 2als “E(X|. 


2) (Chebyshev’s inequality) Let X be a random variable and leta > 0. Then 
or 1 4 
P(X ~ E[X]| 2 a) < (E[X?] ~ BLXP). 
Proof. For the first part, simply note that 


EX] = $7 P(X = 2)e > SW P(X =2)a = aP(X >a). 


« rea 
For the second part, using Markov's inequality, we can write 


P(X ~ E[X] 2 a) = P(X ~ B[X])? > a) < SBU(X ~ ELXY)7 
; ] 


and an easy computation using linearity of expectation yields the result. 


Q 
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As usual, knowledge comes with practice, so we will spend the remainder 
of this chapter by giving a lot of applications of the previous theorems. We 
start with some applications of the sub-additivity of probabilities. One can 
prove the following inequality using standard techniques, but there is also a 
very elegant probabilistic proof: 


B.2. Let xy € [0,1] for 1 <i,7 <n. Prove that 


n nm 


I (: Th) +I] 1-[JQ-2)} >. 
t=] im] 


jan jel 


Proof. Consider a random binary matrix (a,j) such that Play = 1) = xy 
and all these events are independent. Then []7_, (1 — []i21 ai) is simply the 
probability that each column contains at least a zero, while the expression 

fea (1- Wer a- a%y)) is the probability that each row contains at least a 
1, Since all binary matrices have either at least one zero in every column or 
at least one one in every row, the result follows. 0 


Another application of theorem 6.B.1 is the following nice problem: 


B.3. Let S be a finite set of points in a plane, no three of which are collinear. 
For each convex polygon P with vertices in S, let a(P) be the number 
of vertices of P! and let b(P) be the number of points of 5 lying outside 
P (i.e. outside its interior or border). Prove that for all real numbers 2, 


Soa) (1 — cP) = 1, 
P 


where here the sum is taken over possibly degenerate convex poly- 
gons(polygons with 2, 1, or @ vertices), too 


IMO Shortlist 2006 


7 ‘With the convention that a(P) = 0,1,2 if P is @, a point of S, respectively a segment 
connecting two points of S. 
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Proof. It is enough to check the equality for 2 € (0,1), since a polynomial 
vanishing on (0,1) vanishes everywhere. Consider a random two-coloring of 
S such that the probability that a point is black is x and all these events 
are independent. For a given P, the quantity «%(? (1 ~ xy?) is simply the 
probability that the vertices of P are black and the points outside P are white. 
As these events are disjoint, the sum over all P is the probability that there 
is a polygon with black vertices and such that all points outside it are white. 
But this probability is 1, since the convex hull of the black points is such a 
polygon (note that there might be 0,1 or 2 black points and this is why the 
sum is also taken over degenerate polygons). oO 


We continue with two problemns which use the principle of the probabilis- 
tic method, namely the fact that if the sum of probabilities of some events 
is (strictly) less than 1, then there is a point in the probability space not 
belonging to any of these events. 


B.4. Prove that there is a four-coloring of the set M = {1,2,...,1987} 
such that M contains no monochromatic arithmetic progression with 

10 terms. 
IMO Shortlist 1987 


Proof. Pick a random coloring and observe that the probability that M con- 
tains a monochromatic progression of length at least 10 is bounded by N/4°, 
where N is the number of progressions of length 10 contained in M. So the 
problem is solved if we prove that N < 4°. But there are | #8871 | progressions 
of length 10 contained in Mf and whose first term is i. So 


1987 , : 
—— 1987-i 1 
N< ae em 2 
= 9 < ig 2000 


So, it remains to check that 28-5® < 9.29, which follows from 
58 = (58)? < (Q7)? = gig cg. gl, inl 


Recall that the m-th Ramsey number R(m) is the smallest positive integer 
n such that in any coloring of the edges of Ky, (complete graph with n vertices) 
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with two colors, one can find a monochromatic K,, subgraph. Note that it 
is not obvious that this is well defined, but one can prove without too much 
difficulty that any coloring of the edges of K am-2) with two colors contains a 


monochromatic K,,, so that R(m) is well-defined and at most Goss ae (which 
grows roughly as 4”). The following result of Erdés from 1947 is an absolute 
classic. Amazingly, even after more than 60 years, it remains close to the best 
known lower bound (and 4” remains close to the best known upper bound). 


B.5. if 203)-) > (2), then Rm) > n. In particular, R(m) > 27 for all 
m>a, 


Erdés 


Proof. By definition, saying that R(m) > n is the same as saying that we can 
find a coloring of the edges of K, with no monochromatic K,,. Consider a 
random coloring of the edges of Ky, with two colors, each having probability 
1/2. If S is an m-element subset of the vertices of Ky, let Ag be the event that 
the corresponding subgraph of K,, is monochromatic. We want to prove that 
there is an element in the probability space (ie. a 2-coloring) which belongs 
to none of the events Ag. It is enough to check that 


So P(As) <1 
s 
However, it is clear that we have (") choices of sets S and for each of them 
P(Ag) = 2'-(@), 


Thus, as long as 2G) 5 (2), we can find the desired coloring. The second 
part of the theorem is now easy: taking n =< [2"*/*], we have (for m > 4) 


1) eS ee (n/2y" 27 <8 og 
“ml <Ta ee 


Here is a more delicate problem using similar ideas, but for which it is 
more difficult to find a good probabilistic interpretation. 


e 
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B.6. Let Aj, A2,..., An and By, Bo,..., By be distinct subsets of N such that 
A, 0 B; = 6 for all i and (A; B;) U (4; 0 B;) 4 @ for all i 4 7. Prove 
that for all p < [0,1] 


n 


Sopa — py <1. 


i=1 
Tusza 


Proof. Let X be the union of all A; and B,; and consider a random subset S 
of X such that the events x € S for z € X are independent and of probability 
p (more formally, let 2 be the set of all subsets of X and define 


P(S) = pia — py I-151, 


which is a probability measure on Q, because 3>5-.x P(S) = 1 by the binomial 
theorem). Consider the event Ej: A; C S C X ~ B;. By hypothesis, no two 
events occur at the same time, thus 


Do P(E) = PUB) <1. 


On the other hand, we have 


P(E) = SD pS =p) HIS! = pda — py, 
A;CSCX~By 


the last equality being an easy computation left to the reader. Inserting this 
in the previous inequality yields the desired result. 0 


Remark 6.B.3. If all A; have a elements and all B; have b elements, we deduce 
from Tusza’s inequality (by taking p = ais) that n< (ott 


We continue with a series of applications of the linearity of expectation 
and Chebyshev’s inequality. First, a very simple problem: 
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B.7. Let p,(k) be the number of permutations of {1,2,...,n} having exactly 
& fixed points. Prove that 


¥ kpn(k) = nl. 


k= 
IMO 1987 


Proof. Let o have the uniform distribution on the set of permutations of 
{1,2,...,n}. If X(o) is the number of fixed points of o, then 


+ kpn(k) = EIX], 
” km 


On the other hand, 


where X;(o) = iggyai- Then 


E{X] = Poli) =i) = = 


and the conclusion follows by linearity of expectation. oO 
The next two applications are absolute classics. 


B.8. Any graph with ¢ edges contains a bipartite subgraph with at least ¢/2 
edges. 
Erdés 


Proof. Pick a random subset S of the set V of vertices, by including a vertex 
in S independently with probability 1/2. Let X(S) be the number of edges zy 
for which exactly one of z,y is in S. For a given edge zy, the probability that 
exactly one of x,y is in S' is 1/2, so by linearity of expectation E[X] = ¢/2. 
Thus one can find S$ such that X(S) > ¢/2. By construction, the subgraph 
comprising all edges with exactly one vertex in S is a solution. 


6.B. Probabilities in Combinatorics 273 


For what follows, in a tournament there is exactly one match between each 
pair of players and there is no draw. A Hamiltonian path in a tournament is 
a permutation o of the players such that player o(i) beats player o(i +1) for 
all i. 


B.9. There exists a tournament with n players which lias at least =, Hamil- 


wir = 
tonian paths. 


Szele 


Proof. Pick a random tournament (in which the results of each match occur 
with equal probability), and let X be the number of Hamiltonian paths. If ¢ is 
@ permutation of the players, let X,(T’) be 1 if o induces a Hamiltonian path 
in the tournament T and 0 otherwise. It is clear that, as random variables, we 
have X = 3°, Xq. It is equally clear that E[X,] = 2!-” (since the result of 
the matches between o(i) and (i+ 1) is imposed for all 4). Thus E[X jae gis 
and the result. follows. 


A good exercise for the reader is to prove that the following result im- 
plies (a weak form of) Turén’s famous theorem on graphs without n-complete 
subgraphs. 


B.10. Let dy,d2,...,d, be the degrees of the vertices of a graph. Prove that 
one can find a subset S of vertices such that 


1) S has at least 77, gp elements. 


2) There are no edges between vertices in S. 


Proof. Consider a random permutation My of the vertices of our graph G, all 
permutations having equal probability 4 . Let Ai be the event: o(4) < a(j) 
for any neighbor j of i. We claim that PCA) = on . Indeed, we need to find 
the number of permutations o of the vertices seo that o({i) < o(j) for any 
neighbor j of 7. Ify1,..., es are the neighbors of i, there are ( id, ey) possibilities 
for the set {o(#), oly), .,0(y4,)}, di! ways to permute the elements of this 
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set er not (d;+ 1)!, since o(@) is the smallest element of the set) and finally 
(n — d; ~ 1}! ways to permute the remaining vertices. So 


; n \(n-d-Vid! 1 
P(A) = G + nt ~ &+l' 


as claimed. If X is the random variable X(o) = S77, leea;, then 
nm re 1 
E[X] = 2 FAs) = » ee 


Hence one can find ¢ such that 


n 


It is clear that the set of vertices i such that o € A; satisfies both properties. 
oO 


We continue with a beautiful proof, due to Lubell, of a famous theorem 
of Sperner on maximal anti-chains. 


B.11. Let F be a family of subsets of {1,2,...,} which does not contain two 
elements A,B such that AC B. Then |F| < (fa)- 


Sperner’s theorem 


Proof. Consider a random permutation o of {1,2,...,n} (with uniform dis- 
tribution) and let A; be the event that {o(1),0(2),...,0(#)} € F. The hy- 
pothesis on F implies that the random variable X(o¢) = S77, loca, satisfies 
X(o) < 1 for all o, so that its expectation E[X] < 1. On the other hand, 
EX] = S72, P(A,) and the probability that {o(1),...,0(4)} is a given set 
with 7 elements is Sor thus P(A;) = 7#&, where n; is the number of subsets 
with 7 elements in ¥. Putting this together yields the beautiful inequality 


Ty. 
day Sh 
A) 


t=} 
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from which the result follows, as (8 (g ) is the largest among the binomial coef- 
ficients. 


Actually, the proof shows the following more general result, known as 
the Lubell-Yamamoto-Meshalkin-inequality: let. Ay, A2,...,A, be subsets of 
{1,2,...,n} such that no A; is a subset of any Aj, Then 


I 1 
ae <h 
(Au) ‘Tt (Au 


The following famous theorem of Bollobés generalizes this result. further: 


B.12. Let Ay, Ag,...,An and By, Bg,..., By be distinct sets of positive integers 
such that A; n B, = @ for all i, but A; 0 B; #60 for alli # j. Then 


n 


1 
oe TRE: oi) $i 


t=1 
Bollobds 


Proof. We may assume that all Aj;’s and B;’s are subsets of {1,2,. yt} 
Consider the uniform distribution on the set Sy of all permutations of 
{1,2,. -,N}. Let £; be the event consisting of all permutations o with the 
following property: all elements of A; come before all elements of B; in the 
list o(1),(2),...,0(n). It is easy to see that the hypothesis implies that the 
events E; are pairwise disjoint. Thus 5°, P(Z;) < 1. It remains to notice 
that P( Ej) = (ahi ae 4, Bi ay? as any subset with |A,| elements of A; U B; is equally 


likely to form the first [44] elements in the list o(1),¢(2),...,¢(n). The result 
follows. a 


We take a break here to discuss a very nice consequence of Bollobds’ 
inequality. Recall that an r-uniform hypergraph is a pair (V,E), where V is a 
set and E is a collection of subsets of V, each subset having r elements, Let 
K? be the complete r-uniform hypergraph on ¢ vertices, 
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B.13, Let G = (V, Z) be a r-uniform bypergraph on n vertices. Suppose that 
G contains no Kj, but that if we add any r-element set to EZ, at least 
one Af appears. Then 


e()-C89) 


Bollobas 


Proof. Let [n] = {1,2,...,n} and let [n] be the collection of r-element 
subsets of [n]. If A € {n] — B, consider a copy C(A) of AT that appears 
by adding A to E. Then A & C(A), yet A’ is not in C(A) if A! # A is in 
in|) — EB. So, if {Ay,... Am} = [rn] ~ E and B; is the complement of C(A;), 
then (A;); and (Bj); satisfy the hypotheses of Bollobés’ inequality, yielding 
m< ("-'*), The result follows. 


We continue with a very nice result from additive number theory. 


B.i4. Let A C Z/n?Z be a subset with n elements. Prove that there exists a 
2 
subset BC Z/n®Z with n elements such that |A+ Bi > 3. 


IMO Shortlist 1999 


Proof. Pick a random collection of n elements of Z/n*Z, each of the n elements 
being taken with probability 1/n? and all choices being independent. Consol- 
idate the distinct elements among the n chosen ones in a set B, which may 
have less than n elements. Consider then the random variable X = |A+ Bl. 
As 

X= > lica+B, 


iCZ/n?h 
we have by linearity of expectation 


E[X]= S> PGE A+B). 
i€Z/n2Z 
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On the other hand, the probability that i ¢ A+ B is clearly the n-th power 
of the probability that a given integer is not in A, that is 


PGE A+B) =1-(1- ay" i-(1 i)" 


We deduce that a 
E[X] =n? (i- (-2) ) 
nm 


and the result follows from the inequality (1~ 1)" < }. O 


Often, the choices of the probability space and distribution are imposed 
by common sense. However, in order to get better quantitative results, one 
uses sometimes less natural probability distributions. The following problems 
illustrate this: 


B.15. Let V be a set with 7 elements and let F' be a family of m subsets of 
V, each having three elements. Prove that if 3m > , then there exists 
Sc V having at least 28 elements and such that no element of F 
is contained in S. 


Proof. Choose p € [0,1] and pick a random subset S of V such that P(v € 
S) = p for all v € V, all the events being independent. If n(S) is the number 
of elements of F contained in S, then linearity of expectation yields 


E||S| ~ n($)] = El|S{] ~ E[n(S)] = np — mp*. 


This is maximal for p = \/¢= € [0,1] and is equal to 22/2. Thus, we can 
find § such that [5] ~ n(S) > a 3. Choose such S. For each e € F with 
all of its three elements in S, delete arbitrarily one of these three elements. 
We thus end up with at least |S] —n(S) elements of V which obviously have 


the desired property. a 


B.16. ‘The minimal degree of the vertices of a graph G with n vertices is d > 1. 
Prove that there exists a subset S of the vertices such that 


a 
1) [S| <n - 
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2) Any vertex of G is either in S or neighbor of a vertex in S, 


Noga Alon 


Proof. Consider a random subset S of the set of vertices V such that each 
vertex is in S independently with probability p = nee if 9’ is the set of 
vertices which do not belong to S and which have no neighbor in S$, then SUS’ 
satisfies the second condition, It is thus enough to show that 


BIS US] <n. AtREt 


However, 
EIS US" < Ells] + BS" 
= S> P(e 8) +> Pw 8’) 


<Spntn(i— py 
ee itin(d+1) 
< dei" 
the last inequality being equivalent (after dividing by n, canceling p and taking 
logarithms) to log(1 — p) < ~p, which is well-known. oO 


A very beautiful but rather subtle application of the probabilistic method 
is the following gem due to Erdés: 


B.17. Let A be a finite set of nonzero integers. Then A contains a subset B 
of cardinality greater than ‘Al such that the equation x + y = z has no 
solutions in B x Bx B,. 


Erdés 


Proof. The proof is quite tricky. Choose a prime p = 3k +2 large enough, say 
such that p > 2\e| for all a € A. The subset S = {k+1,k+2,...,2k +1} 
of F, is a sum-free subset, ie. the equation ¢+y = z has no solutions with 
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@yz2€S. Pick re Fy, randomly with uniform probability and consider the 
set 

A, ={a@€ Ajax (mod p) € S}. 
This is easily seen to be a sum-free subset of A. To prove that one can choose 


# such that |A,| > lat it is enough to compute the expected value of |A;|. 
Linearity of expectation shows that 


EllAcl] = 5° Plax (mod p) € S) 
aca 


and it is clear that for any a € A we have sca mod p) € S) = # as 


(a2)zers is a permutation of F5. Thus E{|A.|] > # {Al and we aredone. O 


We end this chapter with a very beautiful application of Chebyshev’s 
inequality. Before attacking this problem, let us recall a few basic properties 
of the variance. Note that for any constant ¢ and any random variable X we 
have Var(cX) = c?Var(X). Also, it is not difficult to check that if X. sY are 
independent variables, then Var(X + Y) = Var(X) + Var(Y). Finally if X is 
0 with probability p and 1 with probability 1 — p, then Var(X) = p(1 ~ p). 


B.18. Let v;,v2,...,¥,_ be n vectors in the plane, whose coordinates are integers 
of absolute value less that Wa Vi R. Prove that there are disjoint subsets 
I, J of {1,2,...,n} such that 


Saeed es 


tel jes 


Proof. Note first of all that it is enough to find two distinct such subsets J aoe 
for then it is enough to take out of each the common elements of J, J. Now, 
assume that we cannot find two such distinct subsets and write vj = (21, yi). 
Consider a random binary sequence (a;, a2,...,an), each a, being 0 or 1 with 
probability 1/2 (all these events being independent). Consider the randorn 
variables X = pes 1 GE; and Y = Dont ayy;. The point is that the values 
taken by (X,Y) are all different and that a lot of values are concentrated near 
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the expectation of (X,Y). To be more precise, let m, = E[X], my = ElY] 
and observe that the discussion preceding the problem yields 


v4 n 
Ns ole a Cal eee 
Var(X) = Var oy aizi) = Ls 1 < 70000" 
by hypothesis. Let 0? = 275. Using Chebyshev’s inequality, we obtain 


P(X = mz| 220) <} 


Sy PY —my| 2 20) < 


1 
, 


hence 


1 
2 
This means that at least half of the (pairwise distinct) points (X, Y) are located 
in the square |X —m,| < 20, [Y ~my| < 2c. But this square contains at 
most (1 + 40)* laitice points, so 


P(|X — mz < 20,|¥ ~ my| < 20) > 


(14 40)? > 9777, 


which is easily seen to be impossible (as say 250% > (1 + 40)* for n > 16) 


unless r = 1, in which case we are easily done. a 


Chapter 7 


Complex Combinatorics 


There is a class of combinatorial problems, most of the times with number- 
theoretic flavor, which have very elegant solutions using finite Fourier trans- 
forms or versions of it. The main observation is the identity 


k-1 


1 j(n— 
Tnze (mod &) * yo zie, 
7 


which holds for any primitive root of unity z of order k. This gives a rather 
powerful approach te problems concerning the distribution mod & of the sums 
of subsets of a given set or to tiling problems. Actually, this relation is a 
very special case of a much broader theory, that of representations of finite 
groups. The case of finite abelian groups is particularly easy and useful in 
combinatorial problems and we refer the reader to addendum 7.A for a more 
detailed discussion. For the next problems the previous relation is actually 
sufficient. 


7.1 Tiling and coloring problems 


We start with two tiling problems which have rather elegant solutions 
using complex numbers. Of course, the idea is similar to the usual coloring 
method, but we think it is neater. 
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1. Let & be an integer greater than 2. For which odd positive integers n 
can we tile an x n table by 1 x & or & x I rectangles such that only the 
central unit square is uncovered? 


Gabriel Dospinescu 


Proof. Assign coordinates to the squares of the table, with (0,0) assigned to 
the upper left corner. Put the number 2**+¥ in square (a,y), where z is a 
primitive k-th root of unity. Then all 1 x k and k x 1 tiles will cover a total 
value of 0. Thus the total value of the whole board must be 0. On the other 
hand, this total value is precisely ( oot #) —2z"-l_ Thus we must have 


(oa 2)’ ~ 2") = 0 so for some ¢ € {-1,1} we have 25) = ¢-2"F". This 
can be also written as (275 - eat +¢) = 0, which immediately implies 
that. n is congruent to 1 er —1 (mod k). The converse is easily seen to hold, 
for if n = 1 (mod &) then we can tile both the center column and the center 
tow {excluding the center square) using the rectangles, leaving four rectangles 
of dimensions multiples of k. And if n = —1 (mod &), then we can split the 
table into four rectangles each with a dimension a multiple of k and so again 
the tiling is possible. ‘Therefore the answer to the problem is: all n for which 
n = +1 (mod k). 


2. Let n > 2 be an integer. At each point (7,7) having integer coordinates 
we write the number i+ 7 (mod n). Find all pairs (a,b) of positive 
integers such that any residue modulo n appears the same number of 
times on the sides (except for the vertices) of the rectangle with vertices 
(0,0), (a,0)}, (a,8), (0,6) and also any residue modulo n appears the 
same number of times in the interior of this rectangle. 


Bulgaria 2001 


Proof. First, let us get rid of the easy case n = 2. As the interior of the 
rectangle must have an even number of lattice points, clearly one of a, 6 must 
be odd. But if one of a,b is odd, then clearly all conditions are satisfied, so 
that in this case the solutions are all pairs (a,b) with a, not simultaneously 
even. 
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Now, let us treat the more difficult case n > 2. Let z be & primitive root of 
unity of order 2 and put the number 2‘¥) in the square (4,3). By hypothesis, 
the sum of the numbers associated to the squares of the rectangle is 0, so that 

aot sréct 247 = 0. But this factors as 


j=l Lage 


so that necessarily one of a,b is congruent to 1 modulo n. By symmetry, we 
may assume that a = 1 (mod n). Now, since every residue modulo n appears 
the same number of times on the sides of the rectangle, we also have 


b-1 a-t 
(24 +1) Se + (2°41) (= ‘ ary, 
jet i=l 
As a= 1 (mod n), the previous equality becomes (z+ 1) ( ra ca ) = 0. But 
n > 2, so that +1 0 and so we must have payee 2? =Oandb=1 {mod n). 
Since it is clear that for @ = b = 1 (mod n) all conditions are satisfied, it 
follows that this is the solution if n > 2. Ea 


The next two problems have very beautiful solutions using the methods 
of this chapter and some classical algebraic identities. 


¥ 3. Each element of M = {1,2,...,n} is colored in either red, blue or yellow. 
Let A be the set of triples (z,y,z) € M x M x M such that n divides 
a+y+z and z,y,z have the same color. Let B be the set of triples 
(x,y,z) € M x M x M such that n divides x + y+ 2 and z,y,z have 
pairwise different colors. Prove that 2|A] > |B]. 


Chinese TST 2010 


Proof. For simplicity write 1,2,3 for the colors and let 


f(X)= SO x 


ie Mcli)=j 
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where c(i) = j if i has color j. The identity 


k-1 
1 ai 
Inze (mod k) = pos zi(n @) 
j=0 
yields 
3 eo! Gee yo 7 Zink, 
Al = s 5 ~Sre 28 (aytz) . SS file) 
j=l ela)=cly)=czj=j | k=0 k=O jel 
and 
6 eh ink 
iBl=— dT file. 
k=O j=] 


Of course, this method has some limitations, because it is rather difficult 
to compare complex numbers. The miracle is in the formula 


uniess k = 0, when it equals n. 
Putting these observations together, we deduce that 


244i — |B] = (AG) - A)? + (AG) ~ AM)? +(AD-AMP20 a 


‘The following probiem is very similar, but more technically involved. 


4. Color the numbers 1,2,...,N using 3 colors such that there are at most 
x numbers of each color. Let A be the set of 4tuples (a,b,c,d) € 
{1,2,...,N}4 such that 2-+b+¢+d = 0 (mod N) and a,6,c,d have the 
same color. Let B be the set of 4-tuples (a, b,¢,d) € {1,2,..., N}4 such 
that a+6+c+d=0 (mod N), a,b and ¢,d have the same color, but 
these colors are distinct. Prove that |A| < |B}. 


K6MaL 
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Proof. Let us write z, = e'W* and e(j) for the color of number j. Define 
fj (X) = Coals X*. As in the previous solution, one ends up with the 
formula 


, Net 
[B] — [A] = x >> (ft (24)? fo(2n)? + 2fo( za)? fa(2n)? + 2fa(zn)* fa (zx)? 
kad 


~ fi(aie)* ~ folzn)* — fo(zn)4). 
Readers familiar with Heron’s formula have already noticed that 
2(a*b? + be? + a?) — (a4 +644 cy 
=(at+b+e)(b+c-a)(c+a~b)(at+b—c). 
Also, as in the previous solution we have 


N 


Silex) + folzn) + falex) = S78 


a=1 
and this is nonzero if and only if & = 0, in which case it equals N. The result 
follows from these remarks and the hypothesis, which ensures that 
SiG) + fold) > fa) 


and similar inequalities. 0 


7.2 Counting problems 
In this section we combine the technique of generating functions with the 
fundamental relation : 


k-1 


1 ‘ 
lnsa (mod &) * pe gees 
5x0 


This mixture is very powerful and applies very well for counting problems with 
arithmetical flavor, as roots of unity can detect congruences very efficiently. 


i 
if 
i 
i 
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5. Three persons A, B,C’ play the following game: a subset with k elements 
of the set {1,2,..., 1986} is selected randomly, all selections having the 
same probability. The winner is A,B, or C, according to whether the 
sum of the elements of the selected subset is congruent to 0, 1, or 2 
modulo 3. Find all values of & for which A, B,C have equal chances of 
winning. 


IMO 1987 Shortlist 


Proof. Let. z be a primitive third root of unity and consider the polynomial 


1986 
PCX) = (1+ X2)(L+ X27). (1+ X21) = Say x4, 
20 


Note that 
P(X) = y 2D xi 
IC {1,2,..., 1986} 


where m(J) is the sum of elements of J. Thus 


a= So 2m) = » 1+ S- i) > :) zt 
()-1 [i=k,3lm(}-2 


Wl=3 Ul=3,3}m(2) HU |=k,Strm 


and this is zero if and only if 


ee Se ee 


|= 7.3im(7) Hl=j,dlm{2)—-1 H1=7,.3)m(2)~2 


Therefore, the problem asks precisely for those k such that ap = 0. Now, 
fortunately the polynomial P has a very simple form, since 


P(X) = (1+ X2)(L+ Xz)(1 + X27)? = (1 4 x 3)602, 


so the nonzero coefficients are precisely those whose index is a multiple of 3. 
Thus the answer to the problem is: those k = 1,2 (mod 8). O 
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We present two solutions for the following problem: the first one is the 
standard proof using complex numbers, while the second one is a very elegant 
probabilistic proof, 


6. We roll a regular die n times. What is the probability that the sum of 
the numbers shown is a multiple of 5? 


IMC 1999 


Proof. We need to count the number of sequences (x1,@2,...,2n) with 1 < 
x; < 6 and such that 5 divides a; + 29 +-.. 4 Zn. Let a; be the number of 
sequences (21, £2,...,2,) such that 1 < 2; < 6 and M+ ea +--+ a, =F 
(mod 5). Then for each fifth root of unity z we have 


ag + az +++ + agzt = > griteete tie x (z 4 22 4... 4 28), 
1521 ynytn <6 


This equals z" unless z = 1, in which case it equals 6”. Adding these relations 
for all fifth roots of unity z gives 5ag = 6" + 2% + 22" 4 23m 4 24", for some 
primitive root of unity of order 5. If n is a multiple of 5, 27 4+--.4+ 4" a4 
and the probability is $2 = £34. If not, then 


2 on 
Pa I id a =—], 
so that the probability is aah a 


Proof. We use a simple fact from probability. Suppose X is an integer-valued 
random variable which is uniformly distributed mod m (that is, takes on every 
value mod m with probability 1/m) and suppose Y is any other integer-valued 
random variable independent of X. Then X +Y is also uniformly distributed 
mod m. 

To use this we break a roll of a die into two steps. We first flip a coin 
with a 5/6 probability of giving a heads. If we get a heads, then we roll a fair 
5-sided die with the numbers 1, 2,3, 4,5 on it. If we get a tails, then we just get 
the number 6. If any one of our n coin flips gives a heads (which occurs with 
probability 1— (4), then one of our summands is uniformly distributed mod 
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5 and hence the sum is uniformly distributed mod 5. Otherwise, all the coin 
flips were tails and hence all the dice were 6’s, so that the total i a8 n {mod 5). 
Thus the probability of getting a sum which is a | multiple of 5is 21 (4)") 
for n not a multiple of 5 and 2 (1 ~ (4)") + (4)” for na multiple of5. 


The following problem is a good opportunity to recall a very useful result: 
if p is a prime and if ag, a},. +++5Q@p—4 are rational numbers such that ag + 
@12 + +++ + Gp..12?7) = 0 for some primitive root z of order p of unity, then 
a = @, = +-+ = G@p_y, This is an immediate consequence of the irreducibility 
of +X +---+X?-? over the rational numbers. 


7. Let p > 2 be a prime. How many subsets of ey are 
sum of their elements divisible by p? 


p— 1} have the 


Ivan Landjev, Bulgaria TST 2006 
Proof, Let z be a primitive root: of order p of unity and consider the sum 


S= > gt) 
AC{1,2,...p-1} 


where m(A) = }7,¢4@. If x; is the number of subsets A C {1,2,...,.p—1} 
such that m(A) = j (mod p), then clearly 


Ssagt aye + aez* 4-04 Gp_y2P). 
On the other hand, we can explicitly compute S, since 


S= Tla+)= TIa+9, 


i=l ¢ 
the product being taken over all roots ¢ of the polynomial 


‘We deduce that 
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So ag ~ 1+ aiz+-+++ ap_12z?-+ = 0, which implies that zp — 1 =a) =--. = 
Zp-1 = k for some k. Since 2p + 21 +--+ + xp_zis simply the number of 
subsets of {1,2,. ~ 1}, that is 2?-1, we deduce that kp + 1 = 27! and go 
eee) %q = 1+, the problem is solved. Note that. we included 
the empty set when counting zg (by convention the sum of the elements of the 
empty set: is zero). O 


The following problem is technically more involved, but uses the same 
ideas as before. The special case when n is a prime was problem 6 of IMO 
1996. 


8. Prove that the number of subsets with n elements of {1,2,..., 2} whose 


sum is a multiple of n is 
= S-0% (3) (7): 


din 
Proof. Consider the polynomial 
F(X,Y) = (1+ XY\1+ X°Y)---(14 XY). 


If m(A) is the sum of the elements of A, we also have 


an 
FOGY) = STF SO xe | ve, 
a=0 \[Al=a 


where all sets A in this solution are subsets of {1,2,...,2n}. Taking for X 
ix, 
the roots of unity of order n, say z; = € “a”, the fundamental relation implies 


that 
an 


L f(a, ¥) 4-04 fem ¥)) = Sota ¥*, 


a=} 


where x, is the number of subsets A with a elements and such that m({A) #0 
(mod n). We deduce that 2,, which is the number of sets A we are trying to 
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find, is also the coefficient of Y” in the polynomial 4(f (21, ¥)+--++f(zn.Y)). 
Now, fix j and let us compute 


fle. ¥) = (1+ 4¥)(14 BY) (1+ 22" ¥). 
We can write j = du,n = dv, where d = ged(n,j) and then we clearly have 
fe.¥)= (4 Y)1 +e FY). (14 ey) = 1 = (=) 


So the coefficient of Y" is (-1)4+" (*4). Since for any din there are exactly 
y(n/d) integers 1 < 7 < 2n such that ged(j,n) = d, we deduce that the 
coefficient of Y" in 4(f(a,¥) +++: + f(z, ¥)) is exactly 


EP -n4e(3) (4), 


which finishes the solution. a 


We continue with a very nice problem concerning the number of solutions 
mod p of a quadratic equation in several variables. Actually, the same methods 
also apply in some other situations, but the general problem of estimating the 
number of solutions mod p of systems of polynomial equations mod p is very 
deep. See the addendum of this chapter for more details. 


9. Let p be an odd prime. Find the number of 6-tuples (a, 6,c,d,e, f) of 
integers between 0 and p — 1 such that 


40+ zd? +e7 +f? (mod p). 
MOSP 1997 


Proof. Let z be a primitive root of order p of unity. Since Bay gh = Q if x 
is not a multiple of p and equals p otherwise, the desired number of 6-tuples 
is 


1 
1 a 2g 242m d2—e2 — f? 
S=a- phate bP dP me? f?)_ 
ee? 
a,b,c,d,e,fEZ/pL k=O 
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Note that 


pri 
1 2452402 
a= k(a?4624-02—d? 02 f2) 
gt SS 
k=0 a,b,c,d,e,fEZ/pz 


i p-l ( 3 3 

at > S ka? > —kd? 

= z : z : 
PO \aez/pz eZ / pl 


In the previous sum, there is one obvious term: the one for k = 0, which 
gives us p®. Also, for each 1 < k < p~ 1 we have 


(z).(pew)- pwn pm 


a Fy deZ/pZ adeZ/pZ, Tyek/ pe 


by the change of variable a — d = z,a +d = y (which recovers a,d uniquely 
because 2 is invertible modulo p). On the other hand, for a fixed x 4 0, we 


have 
ye Yo no, 
yeL/ pL yeh / pe. 


as the map y -» key (mod p) is a bijection of Z/pZ. 
Doayer, [pt kev = p, which shows that 


[s ke . > ght? =p. 
acZf/ph deZ/ pe 


Sombining the previous paragraphs yields the answer to the problem, namely 
p+ (p—1)p?. oO 


Therefore, 


Proof. First look at a? ~d? = (a ~d)(a+ d) mod p. Since p is an odd prime, 
we can choose ¢ = a ~d and y = a +d arbitrarily and then solve uniquely for 
aand d via a = (x+y)/2,d = (y—x)/2. If x is nonzero, then since p is prime, 
zy takes on every value exactly once whereas if z = 0 then zy is always zero. 
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nonzero value p—1 times. Therefore it is described by the generating function 


Qp-1t(p- WX + (p— WX*+--- + (p— 1) XP I 
=p+(p—1(L+X4X74---4 XP) 


in Z[X] modulo X? ~ 1, Note that modulo X? — 1, the polynomial 
14X4X?74.-.4 xP7} 
has the feature that 
f(X)\G 4X 4X7 4---4 XP) = ft X4 XP 4-0-4 xP) 


for any polynomial f(X) (as X — 1 divides f(X)— f(1)). Thus the generating 
function for the values taken on by (a? — d?) + (6? — e”) + (c? — f*) is 


(pt (p-DQEX4X24---4-xP HP 
= p> (p> ~ p?)(L +X + X?4 +--+ XP74). 
From this we read off the number of 6-tuples for which a?+6?+c? = d?+e?+ f? 
(mod p) as p? + p? ~ p*. o 


We continue with a very nice problem, in which one uses a mixture of 
the previous techniques, algebraic manipulations and a rather tricky number- 
theoretic argument. 


=1 _ 

10. Let p be an odd prime. Prove that the 2°" numbers £14:2+---+2 
represent each nonzero residue class mod p the same number of times. 
Compute this number. 


R. L. McFarland, AMM 6457 


pit 
Proof. Let z= e? and write 


> 


ee{-1)} 


<1 
eq bdegde + Pee pt ; a 
z PEP = ag + az +--+ + apeaz? 
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for some integers a;. Since z* only depends on 2 (mod p), it is clear that 
a; is exactly the number of ways residue i is represented by the numbers 
+142 ee, Thus, the problem asks us to prove that a, == ag = +--+: = Apne) 
and to find this common value. 

The point is that $ has a nice closed expression, since it obviously factors 


as 
at 
S= [][(i+27). 
j=l 
On the other hand, 
pol pel a 
T@+2%=s- Tf +) =8 [erro yes 
j=l jth gel 
and 
pal pol 
(1+ 2%) =[[Q+2)=1 
jal jul 


The last relation uses the fact that © -+ 2x is a bijection of the nonzero 
remainders modulo p (as p is odd) and that 


pei 
[[a+2) =1, 
j=l 


which follows from 


p-l 
its ~A)= XP a} 
j=l 

by taking X = ~1. 

The previous computation shows that S? = 1, so that S = +1 is definitely 
an integer. But then then relation ag —- S + a@jz +++ + Api?! == 0 implies 
that ap ~ S = ay = ++: = dp... In particular, aj = --- = G@p—1 and the first 
part of problem is solved. 
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On the other hand, we clearly have 
ag + a; toe payy = 27F, 
which combined with ag ~ S = a, = +++ = Q@—, and with S$ = +1 shows that 
S=2°F (mod p) =(-1)"*" (mod p), 


pnd , Z 
so that S$ = (—1)"=" and the value we are looking for is 


2?s' — (1) ee 


Qy = Og + Ay 
f p 
We used here a standard result in quadratic residues, saying that Legendre’s 
ee! 
symbol (2) = (-1)7*", Oo 


It is rather difficult to approach the following problem directly with the 


methods of this chapter. However, a small observation reduces the problem 
to a more familiar one. 


11. a) Let » be an odd integer. Find the number of sequences 
(@0,@1,-..,;@n) such that a; € {1,2,...,} for all i, a, = ag and 
a, ~ a1 #4 (mod n) for all i= 1,2,...,n. 


b) Let n be an odd prime. Find the number of sequences 
(a0,41,...,@n) such that a; € {1,2,.. .,7} for all 2, a, = ag and 
a; ~ Gy, Ft, 2t (mod n) for all i= 1,2,...,n. 


Reid Barton, USA TST 2004 
Proof. a) Call a sequence as in the problem admissible. Considering 
b; = a;~ aj. ~ 7% (mod n) 
for 1 <i <n, the condition becomes 6; 4 0 (mod n). Note that 


by + bg tee thn S-(L+24---4+n) =O (mod n), 


| 
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since n is odd. But conversely, if we have a sequence 6), b2,...,bn of elements 
of Z/nZ satisfying 6; # 0 (mod n) and b; +b2+---+b, = 0 (mod n), then we 
can find precisely n admissible sequences giving rise to 6;,b2,...,5,. Indeed, 
choose any ag € {1,2,...,n} and take for a, the remainder modulo n in 
{1,2,...,2} of the number a9 +14+2+4--++h +0) + b2+----+ by. Thus, 
if f(m) is the number of sequences (b1, b2,..-,5n) € (Z/nZ — {0})” such that 
by + bg +++) +b, = 0 (mod n), then the desired answer is nf(n). 


2x 
Now, we evaluate f(n) in the standard way: let z =e", so that 


nt 
f(n) = - > S ZMortbat-+n) 


br y-rsbn €Z/nZ~{0} k=O 


t J ai iy 
SS kb 
=— z 
Peo Veni 
Since 
n=l 
bea =n-1 
b=1 
for k = 0 and it equals pe sae ~l=~I fori<k<n-—1, we deduce that 
n—1)"—(n-1 
fin =O en) 


nr 


so the answer to part (a) is (vn ~ 1)” ~ (n~ 1). 

b) The same argument as in the first paragraph of (a) shows that it is 
enough to count the number g(n) of sequences (61, b2,...,5n) with b; € Z/nZ, 
5; # 0,4 (mod nz) and 6; + bo+---+b, =0 (mod n). The desired answer will 
be ng(n). 
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We compute in the same way 


g(n) = - > So bite -+bn} 
* ean k= 
hk 5 Sy thn 
nos 0 by B04 
p= nm 


3 eg 


k= i=l \be0,i (mod n) 
For k = @ the corresponding product equals trivially (n — 2)". (n — 1) (the 
factor n ~ 1 comes from the sum associated to i = n). Now fix 1<k<n—1,. 


For all 1 <i <n we have 


hb pe hep git “Dh gg ght) = (1 + 2), 
b40.4 (mod n) 


For i = n the corresponding sum is ~1. Thus 


Ba nai ' 
Ti( © **)=-Tfaes 
i=1 \b#0,i (mod rn) i=I 


ne since n is a prime and 1 < - < 7, we have saa n)= =1, so bas nunobers 


Thus 


om 
JJa+)= J] G@eo=1 
i=t atom tug] 


the last equality being a consequence of the equality 


Il «-»- te i 


uatuAl 
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and of the fact that n is odd. We deduce that. 


n 


M( x l= 


t=1 \b40¢ (mod n) 
for all 1 < k <n — 1 and so the answer to the problem is 


Gene Ue ie ==) ; 


The following problem is hard, even though the first steps are rather clear. 
The difficulty lies in the algebraic technicalities. 


12. Let p > 3 be a prime number. 


If X is a nonempty subset of {0,1,...,p—1}, let f(X) be the number 
of sequences (@1,42,...,@p~1} such that a; € X for all j and p divides 
, i jaj. Prove that f({0,1,3}) > f({0,1,2}), with equality if and 
only if p = 5. 


IMO 1999 Shortlist 


Proof. Let us first find a formula for f(X). Letting z = er, the usual 
argument yields 


& iS Sy eHlerttartHo-tap-1) 


ke=€) ay Gp 1 EX 
(=) nies ( stot . 


In particular, 


pei pol 
£({0,1,2}) = pe [[a+ 2#4+2%) 


#=0 \j=i 
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Since {21 <j < p~1} = {24|1 <j <p—1} for all i 40 (mod p) and 
since p > 3, it follows that 


F({0, 1,2}) =; ( +(p- 


We also have 


y Pat feat ; S 
£{0,1,3)) = 5 [[ +67 +6) }- 


i=? \Gud 


So, if a, 8,7 are the roots of <?+2+1, we have (using again that for alli #0 
(mod p) the numbers (27); are a permutation of the numbers (z?);) 


Pp ay . . 
£({0,1,3}) = ; (> +(p- DTC - le - aye - ») 
jel 
_ 14 (p— Dap 
a , 
where “A fe 
a, = DG NO*= 0) 
“ (@-DNE-YO-}) 
It is thus enough to prove that a, > 1, with equality precisely for p = 5. 
However, this sequence satisfies a linear recursive relation with characteristic 
polynomial 


(X — a)(X — BX — y)(X a BYX ~ B-y)(X ~a-y(X—a- By) 
= (X41) 4 X 4 IX ~ a+ BX ~ 8 -7)(X - +9). 


We can easily compute that 


(X —a- A(X —B-y\(X—a-y) = (x+2) (x+ ) (x+5) 


= X3— X71 
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and so the characteristic polynomial of the sequence (anja is 


(a +a + 1)(08 — 2? — 1)(@ +1) = 27 ~ 223 ~ 20? — 22 — 1, 
Thus, there exists a constant C’ such that for all n we have 
int? = Udnt3 + On42 + Qn41) +n + CO 


It is not difficult to compute that the first terms of the sequence (ap), are 
0,1,1,3,1,1,3,8,... and that the sequence is increasing after the sixth term 
{and C = —2). The result follows easily. 


Remark 7.1. Except for the technical combinatorial part, which, as we have 
seen, is quite standard, the difficult point of the problem is establishing the 
inequality ap 2 1, with equality precisely for p = 5. Proving that a, > 1 is 
however rather easy, without the computation of the characteristic polynomial. 
Indeed, note that a, is a symmetric polynomial with integer coefficients in the 
roots of X? + X + 1, so it is an integer. On the other hand, the polynomial 
function 2? + 2 +1 is increasing on the whole real line, so among a, 8,y 
precisely one is real, say a. Moreover, it is easy to check that ~1 < a < 0. 
Thus 4:2 > 0. Also, 


a 
>0, 


(1 BP) = 7?) l ~ BP 
G-Ai=%) “lis 


so that ap > 0. It follows that @ > 1. However, we know no easy proof of the 
fact that ay > 1 for p > 5. 


7.3 Miscellaneous problems 


13. Let ax, by, cx be integers, k = 1,2,...,n and let f(x) be the number of 
ordered triples (A, B,C) of disjoint subsets (not necessarily nonempty) 
of the set S' = {1,2,...,n} whose union is S$ and for which 


> a;+ 2 b+ e ¢ =a (mod 3). 


ieS\A ieS\B so ieS\C 
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Suppose that f(0) = f(1) = f(2). Prove that there exists i ¢ S such 
that 3 | a; + & + ¢. 
Gabriel Dospinescu 


Proof. Observe that 


nm 


F(X) = [] (ae 4 xh + xt) 
isl 
& Ss KX dessa Ht Lies/a it Liesso% 
{A,B,C} 


Thus, the equality f(0) = f(1) = f(2) = 3°"! becomes 
F(X) =3"71114X4X") (mod X*~1) 


Since 1+ X + X?{X9 — 1 this means in particular that 1+ X + X?|F(X). 
So there exists ¢ such that 2% +h 4 2bta 4. 2%+4 — 0, where z is a primitive 
third root of unity, This means that {a; + bi, b; + c,q@ + ai} is equivalent 
to a permutation of {0,1,2} (mod 3) and so 8|a; + 5; + cq. The conclusion 
follows. O 


Readers who are not familiar with basic linear algebra will have a rather 
hard time trying to solve the following problem. 


14. Let p be an odd prime and n > 2. For a permutation o of the set 
{1,2,...,n} define 


S(a) = o(1) + 20(2) + +++ 4+ no(n). 


Let A; be the set of even permutations o such that S(a) = j (mod p) and 
let B,; be the set of odd permutations o for which S(a) = j (mod p). 
Prove that n > p if and only if Aj and Bj have the same number of 
elements for all 7. 


Gabriel Dospinescu 
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Proof. Consider the matrix A == (X"), whose determinant is given by 


det A == > XS) _ St x8), 
o even o odd 


On the other hand, we also have a simple closed form for det A, given by 
Vandermonde’s formula: . 


det A = Il (x7 — x’). 
isi<jsn 


" aa first formula and the definition of A;, B; imply the following equality 


pl 
S74; — By) Xi = Il (X?— X*) (mod X? — 1), 
j=0 


isi<jsgn 


Therefore, the problem comes down to showing that 


isi<jsn 


This is actually very easy. Indeed, if n > p then clearly 


XP — 4xPt] _ x] Il (xi ~ x#), 
isi<jcn 


are done. - 
a 


The following problem is very challenging. Here, it is very hard to find 
the correct approach, especially because the problem suggests number theory. 


15. Is there a positive integer k such that p = 6k +1 is a prime and CG) =1 
(mod p)? . 


USA TST 2010 
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Proof. The answer is negative, but the proof is far from being obvious. Sup- 
pose that p = 6k +1 satisfies (5*) = 1 (mod p). Let g be a primitive root mod 
pand let z = g®, so that z has order k mod p. Consider.the sum 


kw} k-1 3k 3k | 3k 3k kel 7 
amYart= 2 (2G) )-L Ge" 
i=O a=0 \j=0 j=0 i=0 

hee 


Since z has order k mod p, the sum pay 2? is 0 unless j is a multiple of k, 
which happens if and only if 7 = 0,4, 2k, 3k. Hence 


s=((o)+(i) +(e) * (a) # 


Since (5*) = 1 (mod p), we deduce that S = 4k (mod p). On the other hand, 
(1+ 2*)* = (14 ye =~-1,0,1 (mod p). 


It is now immediate to check that we cannot have k remainders mod p, each 
of them ~—1,0 or 1, adding up to 4k modulo p. This contradiction shows that 
no such k can be found and solves the problem. Oo 


We present two difficult solutions for the following beautiful, but very 
challenging problem. 


¥ 16. Let p be an odd prime and let a, b,c, d be integers not divisible by p such 


{=} {>} {=} {=I 

— + =2 

Pp Pp P Pp 

for all integers r not divisible by p (here {-} is the fractional part). Prove 


that at least two of the numbers a+ b,a+¢,a¢+d,b+¢,b+d,c+d are 
divisible by p. 


Kiran Kedlaya, USAMO 1999 
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Proof. Let r(x) € {0,1,...,p — 1} be the remainder of z mod p and observe 
that r(x) = p-{ Sh. Thus the hypothesis can also be written 


r(an) + r(bn) + r(cn) + r(dn) = 2p 


for any gced(n,p) = 1. Let z be a primitive root of unity of order p and let 
a’,U/,¢,d be the inverses of a,b, ¢,d modulo p. For any m not a multiple of 
p we have a2! — 27. sol we have Sor(an)ere" = 22". Pix a number 
m relatively prime to p and let. n run over a system of representatives of the 
nonzero classes mod p. Then r(mn) runs over 1,2,...,p—1 and so does r(an), 
as a and m are not multiples of p. Hence if we take the sum over n of the 
previous relation, we obtain 


‘p=1 p-1 
ey So jem’ a 2p So 2 = —2p. 
fal = 


It is easy to check the identity 


L$2X fe txt = RAM S (M+ YX" 4] 


which easily implies that 


and similarly for 6,c,d. Thus, we can write 


os i = 
ae Tawe = 


and this for all m relatively prime to p. Clearing denominators and canceling 
equal terms yields (after a tedious computation) an equivalent equality 


+ 3 ylatbltelim x 28M 4 pola titelta!)mn, 


Lan unspecified sums are over a,b, ¢, d, 
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which continues to hold (trivially) when m is a multiple of p. 
Finally, we add these relations for 0 <m <p— 1. Note that 


prt 


’ 
znd!’ 0 


m=0 
and similarly for b’,c’,d’ and that 


pol 
hm >0 
a 


ma 


for any N (simply because the corresponding sum equals p when 2“ = 1 and 
0 otherwise). We deduce that 


pel 
9 ss; Mal Htd+d) > on 


m=) 


which implies that a’ + 6’ +c’ +d’ is a multiple of p (otherwise the left-hand 
side would be @). Therefore, by the previous key equality, we also have 


ah , 
5 2 eee Yeuge me 


for any m relatively prime to p. By multiplying the previous relation by 2~"*’, 
we get 


[agMY—a') 4 2(e-aym 4 pm{d'—a') 


= 7 2a'm ae go late ym x gate jm as gata ye 


and a similar argument (add the equations and observe that the left-hand side 
is at least p) shows that at least one of a’ + Y,a’+c',a' +d! is a multiple of 
p. Say pla’ + 8’, then also pic’ + d’ (as pla’ +b! + c +’) and so pla + 6 and 
ple + d, finishing the proof of this hard problem. 
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Proof. This second proof uses Lagrange’s interpolation theorem, for which we 
refer the reader to chapter 11. As in the previous solution, 


r(x) € {0,1,...,p-1} 
is the remainder of ¢ modulo p. Define 


_ 2r{x) ~ r(2x) 

c= igi re 
so that Q(z) = if 0 < r(a) < (p—1)/2 and Q(z) = 1 if (p~1)/2 < r(z) <p. 
Call (a, 5, ¢,d) good if it satisfies the relation in the problem, which can also 
be written as r(ka) + r(kb) +r(ke) +r(kd) = Qp for all 1 < k <p. Note that if 
(a,b, c,d) is good, then so is (ka, kb, ke, kd) for any k which is not a multiple 
of p. Also, note that if (a, 6,c,d) is good, then 


Q(a) + Q(6) + Qe) + Q(d) = 2. 


Combining these two observations, we deduce that 


Q(ha) + Q(kb) + Q(ke) + Q(kd) = 2 


Q(z) 


forall l<k <p. 

By Lagrange's interpolation theorem, there exists a polynomial P(X) of 
degree at most p — 2 such that P(x) = Q(x) (mod p) for all z % 0 (mod p). 
Note that if R(X) = P(X + 1) — P(X), then R(x) = 0 (mod p) when 


p-3 p+l 
"2s aes 
and R((p — 1)/2) % 0 (mod p), so deg R = p~ 3 and degP = p~2. On 
the other hand, the polynomial S(X) = P(Xa) + P(Xb) + P(Xe) + P(Xd) 
is congruent to 2 mod p for p ~ 1 values of the variable. Since deg S < p ~- 2, 
S ~ 2 must be the zero polynomial mod p. Imposing that the coefficient of 
XP? vanishes in S, we obtain the key relation 


Ea Oe p~2 


aP~? 4 ppr? 4 cP? 4 P-2 = (mod p), 
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which can be also written (by Fermat's little theorem) 
Ay CA pin 
atgt at gre 
Finally, since r(a) + r(b) + r(c) + r{d) = 2p, we havea +b+e+d=Qin 
F,. Combining the two relations yields 
ye 1 


Gb se" 


which readily becomes (after clearing denominators) (a + b)(b-+ e)(e-+ a) = 0. 
By symmetry, we may assume that @+6 = 0 in F,. But then c+ d= 0 also 
and we are done. O 


We end this chapter with a very deep result, whose proof is however 
elementary (and follows {52]). Tt improves previous results of Brown and 
Buhler [12], Frankl, Graham and Réd] [33] and finally Ruzsa [52]. For more 
details on some arguments concerning orthogonality relations and properties 
of characters, we refer the reader to addendum 7.A. 


17. Let p be a prime and let S C (Z/3Z)" be a subset containing no line 
of the affine space (Z/3Z)*. Prove that S$ has at most 22° elements. 


However, prove that we can find such a set with at least 3°°~* elements. 


Meshulam-Roth theorem 


Proof. Let f(d) be the largest cardinality of a set SC Fg (we write Fs for 
Z/82) containing no line. Note that three distinct points of F? adding up to 
the zero vector form a line and conversely, the surn of the points of a line is 
the zero vector. So, f{d) is also the largest cardinality of a set S containing 
no three elements that add up to 0. Since F? and F = Fe (the field with 3¢ 
elements) are isomorphic as additive groups, we may work from now on with 
subsets S of F. 

In particular, if S is such a set, the orthogonality relations for characters 
of abelian groups (theorem 7.A.5) yield 


3 . 
a (Sw) =a > Y= x(si + s2 +53) = [81, 


ses $19283ES X 
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the sum being taken over all characters y of F' = F. '3¢- The whole point is to be 
able to estimate the previous quantity for each character x: Let g{d) = Ld) 


8 
and take a subset 9 as in the theorem, but with |S} = f(d) = 3¢9(d). Note 
that 


Sees — o(d- 1))x(2) = J x(s) 


cer ses 


if y is not trivial and it equals Doses X(8) ~ 34g(d — 1) otherwise (again by 
orthogonality of characters). We deduce that 


2 
IS} = 9(d— ISP + a (= ro) (Sos ~ g{d- xt) 
x 


eS rer 


SS x(s) 


séeS 


j2 
L | 
2 ola 1S? — 
x 


Sees ~ o(d- xt) 


rer 


Here comes the crucial estimate: 


Lemma 7.2. For all characters x we have 


Dltses ~ g(d~1))x(a)| < 3494-1) —|s), 


wer 


Proof. If x is nontrivial, let Hg be its kernel, while if x is trivial, let Hp be any 
hyperplane of F (where F is seen as F3-vector space of dimension d), Take 
41, Hz two affine hyperplanes parallel to Hy so that the H;’s form a partition 
of F. Note that by definition y is constant on each Hj, say x(z) = % for 
x © H; (where of course |z;| = 1). ‘Then 


2 
Sa aemiiwe)| 


eer 


a(|S0 Hil — fd ~ ») 
a 
< isn ml - fa-1). 
ty 


is 


} 
E 
[ 
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But clearly S  H; does not contain three elements adding up to 0 and since 
H, is (d — 1)-dimensional, we deduce that |S H,| < f(d.— 1) (by definition 
of f(d- 1)). Therefore, the previous estimate yields 


S7Gaes ~ 9(4~ 1)x(@) 


eek 


2 
< 34g(d—1)- SoS Hil 
=0 


ix! 


= 8%9(d ~ 1) ~{S! 


which is precisely what we wanted. o 


Coming back to the proof and using the lemma and Plancherel’s identity 
(theorem 7.A.5) 


x 
we deduce the estimate (do not forget that S was chosen such that [S| = 


3%g(d) = F(d)) 


2 
= 34\\, 


x69) 


seS 


1S] 2 g(d— ISP ~ |S1(8%9(a - 1) ~ |S), 


which implies that 


which is precisely what we wanted. 
Finally, let us show that f(3d) > 9¢, which will trivially imply the re- 
maining part of the theorem. Let a = 3¢ 4 1 and consider 


S = {(a,a7)]a € Foc} C Foe x Foe. 


This has 9% elements and we claim that it does not contain three elements 
adding up to 0. Note that a” € Fya if a € Fy, because (a2) = Llifa 40. 
If (aj, a7) add up to 0, we obtain a, + a2 + a3 = 0 and ef + a3 + a} = 0. But 
then 

at + a% + (ay + a9)" = 0. 
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On the other hand, since we work in characteristic 3, we have 


(a, + az)* = (a, + a2)(a1 + an)™ 
= (ay + a2)(az" + a3") 
= af +a} tajah ht agay*. 


We deduce that 


(a, — @2)° = (ay - a2)(az"} oe as) = 0, 


a contradiction. 


7.4 Notes 


We thank the following people for providing solutions to the problems 
discussed in this chapter: Mitchell Lee (problems 1, 2, 10, 15), Richard Stong 
(problems 6, 9), Qiaochu Yuan (problems 5, 6, 8), Victor Wang (problem 16), 
Alex Zhu (problems 15, 16), Gjergji Zaimi (problems 12, 13, 14). : 


Addendum 7.A_ Finite Fourier Analysis 


The identity 


1 A ainkions) 
a a = Lgsb (mod n) 
k=O 


(for all integers a, and all positive integers n) is at the origin of a lot of 
beautiful mathematical results, as we could see in chapter 7, but it is just 
a special case of a broader theory. In the first part of this addendum we 
will see a vasi generalization of this relation, through Fourier analysis on 
finite abelian groups. Though rather elementary, these results are extremely 
useful in number theory and combinatorics. On the other hand, they are the 
very first component of a much broader picture, that of harmonic analysis. 
Things are much easier for finite abelian groups, since one avoids quite a lot of 
technicalities that appear when dealing with harmonic analysis on other groups 
(such as R, the unit circle in C or, much harder, non-abelian and non-compact 
topological groups). For instance, all convergence issues are automatic (as all 
sums we deal with are finite}, while the integration theory has no subtlety 
(on the other hand, if one wants to do harmonic analysis on R, one has to 
be very careful with these issues and one has to develop a rather powerful 
integration theory first). Also, the fact that the group is abelian simplifies the 
problem considerably, basically because all irreducible complex representations 
of such a group are one-dimensional. We will also recall the main features of 
Fourier analysis on finite general groups, without proving anything, since this 
is already the content of a course in representation theory. Next, we deal with 
a classical, but amazing application of these ideas, namely Dirichlet’s theorem. 
This allows us to say a few words about L-fanctions of Dirichlet characters. 
This is again just the tip of a huge iceberg, which is far from being understood, 
even though progress is constantly being made. 


7.4.1 Dual group 


From now on, let G be a finite abelian group with n elements. We want 
to define the Fourier transform of a function f : G— C. Recall that if f is an 
integrable function on R, with complex values, its Fourier transforrn is defined 
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by - 
Re) = | syevay. 


The presence of the characters y > ¢?'**Y of R will be our guide to Fourier 
Pp ¥ 8 
analysis on abelian groups. 


Remark 7.A.1. When dealing with abelian groups (which will always be the 
case for us), one also denotes by + the internal operation of G. This is quite 
intuitive when dealing with groups such as Z/nZ, but definitely not suitable for 
groups such as (Z/nZ)*. Our convention will be the following: when dealing 
with an abstract abelian group G, we will use multiplicative notation for the 
internal operation of the group, while in concrete examples we will choose the 
most intuitive notation, depending on the situation. Hopefully, this will not 
create too much confusion... 

A character of G is a morphism of groups x : G -» C*. So, according to 
whether we use additive or multiplicative notation for the internal operation 
of G, a character satisfies x(x + y) = x(x) - x(y) for all a,y € G (respectively 
x{zy) = x(#)- x(y)). The character is called trivial if y(g) = 1 for all g € G. 
Let G be the set of all characters of G. It becomes a group with respect to the 
obvious multiplication (x1-x2)(9) = x1(g)x2(g) and it is called the dual group 
of G. Note that for all y € G and g € G we have x(g)" = x(g") = 1, because 
g” = 1 by Lagrange’s theorem. In particular, |x(g)| = 1 and y~!(g) = x(a) 
for all g € G and y € G. The idea of harmonic analysis is that all information 
about the space of C-valued functions on G is encoded in G. 


Example 7.4.2. Taken > 2and G = Z/nZ. What is the dual group of G? We 
saw that (1) is an nth root of unity. And clearly x is uniquely determined by 
x(1), as G is generated by 1. Conversely, if z is an n-th root of unity, 2 > 2* 
defines a character of G (by 2” we mean z* for any lifting a of x; this does 
not depend on the choice of a, as 2” = 1). We deduce that G is isomorphic 
to Z/nZ, even though this isomorphism depends on the choice of a primitive 
nth root of 1, so it is not really canonical. 


It is an easy exercise to check that passing to duals is compatible with 
direct products (i.e. the dual of Gx H is Gx H). Since any finite abelian group 
is a direct product of cyclic groups (this is a nontrivial, but. absolutely classical 
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result), it follows from this remark and the previous example that for any finite 
abelian group G, its dual is isomorphic to G. In particular, G and its dual have 
the same number of elements, which is really not obvious at first sight. We also 
deduce from this observation and example 7.A.2 that if ¢ ¢ G—{1}, then there 
exists y € G such that x(x) # 1 (if you think about this for a moment, you will 
realize that it is nontrivial). Thus, the map g +> (x x(g)) is an injective 
homomorphism G -+ G, realizing G as subgroup of G. Since |G] = |G], the 
previous injection has to be an isomorphism. So G is canonically isomorphic 
to its double dual. Let us now glorify what we have just proved: 


Theorem 7.4.3. If G is a finite abelian group, then G is an abelian group 
isomorphic to G and G is canonically isomorphic te G. 


Let F(G,C) be the C-vector space of all maps f : G + C. It is a C-vector 
space of dimension |G], since the map F(G,C) + CI¢l sending f to (f(9))gec 
is obviously a C-Hnear isomorphism. If f,g € F(G,C), let 


h9) = A fe) aC. 


It is easy to check that this is an inner product on the C-vector space F(G,C). 
We are now ready to prove the main theorems of Fourier analysis on G. 


Theorem 7.4.4. The elements of G form an orthonormal basis of F(G,C}. 


Proof. We split the proof in several steps. 

Step 1. We prove that (41, x2) = Lyex., Le. that eG is an orthonor- 
taal set. If yi = x2, everything follows from the fact that y(g) has magnitude 
i for any g and any character 7. Assume that x = et is not trivial. Then 


(xa. X2) = » So x2). 
“IG 
veG 
Let S = Seq x(a). Then for all g ¢ G we have 


9)S = 3° x(ge) = 7 x(e) = 


2éC ©éG 
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because « + ga is a permutation of G. Since x is not trivial, there is g such 
that y(g) # 1 and the previous identity yields S = 0 and so (x1, x2) = 

Step 2. We claim that for all ¢ € G ~— {1} we have Dyed X(z) = 0. The 
same argument as in the first step shows that it is enough to prove that there 
exists y € @ such that x(«) # 1. This is nontrivial, but has already been 
explained. 

Step 3. Since (x) xcG 18 an orthonormal set, it is linearly independent. 
But since it has the same cardinality as the dimension of the vector space 
F(G,C) (namely |G}, by theorem 7.4.3), basic theory of vector spaces shows 
that it has to be a basis of F(G,C). The result follows. Oo 


In practice, one really uses the following consequence of the previous 
theorem: 


Theorem 7.A.5. For any finite abelian group G,, the following relations hold: 
t) aaa relations) For all y,x1,x2 €G and g,héG 


a + So xa(a)xal@) = yrayes a ONO Ea as 


zeG xeG 


2) (Fourier inversion) For all f € F(G,C) we have f = Dyeall, xX. 
3) (Plancherel’s seas For all f € F(G,C) 


ia: OM ayP= SIMA xP. 


xed 


Proof. 1) bas already been proved while proving the previous theorem. For 
2), note that for any a € G we have, by the orthogonality relations 


(=u x): .) ()= qx D 10xw) 
x u 


x 


= GZ LIO LD xe = 1, 
uv xX 
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from which the result follows. Finally, using 2) and the previous theorem, we 
can write 


a LUO = U.N = LDU) Toad bare) = Tso. 0 


eG x1 x2 xed 
Remark 7.4.6. More generally, we have 


(£.9) = UG xd 


xed 


for all f,g € F(G,C), as can be deduced by an obvious adaptation of the proof 
of Plancherel’s identity. 
By analogy with the usual formula in classical Fourier analysis 


2 


5 iny, 
e F(ye dy, 


F(n) 


we write f(x) = (f,x) and call it the y-Fourier coefficient of f. 


7.4.2 A glimpse of the non-abelian case 


One may ask whether the above theory can be adapted to the case when 
the group G is still finite, but, no longer abelian. The answer is positive, but 
things are much more complicated than in the abelian case. In order to develop 
Fourier analysis on any finite group G, one has to consider all complex finite 
dimensional representations of G. By definition, such a representation consists 
of a finite dimensional C-vector space V on which the group G acts linearly, ie. 
for each element g € G one has an automorphism still denoted g of V such that 
gh = goh (in the left-hand side gh is the automorphism associated to gh € G, 
while in the right-hand side we have a composition of automorphisms). In more 
down to earth terms, a representation of G is a morphism? p : G > GL»(C) for 
some n > 1. Two representations pj. 2: G -> GL,(C) are called isomorphic 
if there exists P € Gln(C) such that p2(g) = Ppi(g)P7! for all g € G. 


*Recall that GLa(C) is the group of matrices A € M,,(C) with nonzero determinant. 
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If n = 1, we call p a character (this is compatible with the definition of a 
character given in the previous section). Such a representation V is called 
irreducible if no proper subspace of V is stable under all automorphisms g, 
with g € G. For instance, a character is an irreducible representation, as there 
is no proper subspace at all! Moreover, the only irreducible representations of 
a finite abelian group are the characters, so the new theory will be compatible 
with the theory developed in the previous section. To prove the previous 
assertion, one has to use a basic result from linear algebra, stating that any 
commutative family of endomorphisms of a finite dimensional vector space 
over an algebraically closed field has a common eigenvector. So, if G is an 
abelian group acting irreducibly on V, there is a common eigenvector v for all 
g © G. But then Cv is a nonzero subspace of V stable under all g © Gand so 
we must have Cv = V; hence V is a character. 
We have the following very basic theorem due to Maschke: 


Theorem 7.4.7. Any finite dimensional C-representation V of a finite group 
is a direct sum of irreducible representations, that is there exist sub-vector- 
spaces Vi,...,V_ of V stable under G, which are irreducible representations 
and such that V = @;Vj. 


__ Now, the role of the dual G of G in the abelian case is played by the set 
G of (isomorphism classes of) irreducible representations of G. It turns out 
that this is a finite set and by the previous discussion its definition agrees with 
the usual definition if G is abelian. The set of maps F(G,C) is naturally a C- 
algebra and as such it is isomorphic to the group algebra C[G]. By definition, 
the elements of the last object can be uniquely written S. geg agg With ag EC 
(so the elements of G form a C-basis of C{G]) and multiplication is defined by 


Sag eg . Soya ay. Se ands “Gg. 


HC ace g6G \hk=g 


Note that C[G] itself is a representation of G (each element of G acting as 
multiplication by g). The following theorem summarizes the properties of 


cia). 
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Theorem 7.4.8. 1) G is a finite set and C[G] is isomorphic as a repre- 
sentation of G to ®y.g(dimV)V, where nV = VOV@---@V (n times). 
In, particular, ; 
IG) = SO (dimvy’. 


veG 


2) The center of C{G] consists of all maps f : G + C such that f(g) = 
f(hgh"') for allg,h € G. Its dimension over C is equal to |G} and also 
to the number of conjugacy classes of G. 


In the abelian case, we defined an inner product on CiG] and we showed 
that the characters of G formed an orthonormal basis of C[G]. AI this can 
be done in the non-abelian case, though things are usually more difficult to 
prove. Namely, define for fy, fo € C[G} 


fis) = aD ADAG). 
ic ines 


Now, to any representation p : G > GL,,(C) one can associate its character, 
which is the element of C[G] defined by yp(g) = Tr(p(g)). The main theorem 
of Fourier theory on finite groups is then the following: 


Theorem 7.4.9. 1) If Vi, V2 are two representations of G such that xy, = 
XV as elements of C|G], then Vy and V2 are isomorphic (and conversely). 


2) OW yea is an orthonormal basis of the center of C[G] for the previously 
defined inner product. Moreover, a representation V is irreducible if and 


only if (xv. xv) = 1. 


3) If gh © G, then DLvee xv(g)xv(h) is equal to the cardinality of the 
centralizer of g in G if g,h are conjugate in G and it is equal to 0 
otherwise. 


’The conjugacy class of an element g € G is {hgh {fh € G}. 
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7.4.3 Some concrete examples 


Let us specialize the previously quite abstract {at first...) theory to a 
very concrete situation that appears frequently in number theory. Let N be 
an integer greater than I and let 


G = (Z/NZy" 


be the abelian group of invertible residue classes mod N. A character of G is 
called a Dirichlet character of modulus N or simply a Dirichlet character mod 
N. If x is such a character, we set 


X(T) = lgcatn,wy=r + x(7) 


for all integers n, obtaining in this way an N-periodic function. We will focus 
on a more restricted class of characters modulo N, namely the primitive ones. 
Let us recall what that means. If d divides N and if x¢ is a character mod 
d, then yu yields a character mod N simply by composing it with the natural 
map (Z/NZ)* - (Z/dZ)*. We say that a character mod N is primitive if it is 
not obtained in this way, for any proper divisor d of N and any xg. A more 
practical and useful criterion is the following: a character y mod N is primitive 
if and only if for any proper divisor d of N there exists n = 1 (mod d) such 
that ged(n,N) = 1 and y(n) # 1. We leave to the reader the easy task of 
checking this. 

Let us see what happens when we apply the abstract theory to this situa- 
tion. Let a be an integer prime to N and let f be the map f(n) = lpia (mod N)- 
It is naturally a map on_G and it is clear from the definitions that for all char- 
acters x of G we have f(x) = x(a). So, using Fourier’s inversion formula, we 
obtain 


1 pane 
laze (mod N) ™* oN) y x(a)x(n), 
the sum being taken over all Dirichlet characters mod N. This relation holds 


for all n prime to N and plays a crucial role in the proof of the famous 
Dirichlet’s theorem, to be discussed in the next section. 
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Gauss sums and Fourier coefficients of Dirichlet characters 


To any Dirichlet character x mod N we attached an N-periodic function 
on @, defined by x(m) = Igca(nwyei - X(v2). Any N-periodic function on Z 
induces a map on Z/NZ and we can look at its Fourier coefficients with respect 
to this finite abelian group. As we have already seen, the characters of Z/NZ 
are identified with Z/NZ (we identify @ and the character « 3 eur), Via 
this identification, the Fourier coefficients of a map f on Z /NZ will be denoted 


for) = 4 Vo swe**. 


The following result gives further information about these coefficients when f 
comes from a Dirichlet character: 


Proposition 7.A.10. Let y be a Dirichlet character mod N. 


1) For any a relatively prime to N we have 


2) If x is primitive, then X(a) = 0 whenever ged(a,N) > 1 and we have 


1 


Ral = Te 


for ged(a, N) = 1. 


Proof. 1) Since x(x) = 0 whenever gced(a,N) > 1, we have 


— BE * : 
where (= eS’. As 4 ~ az is a permutation of G, we have 


x@)Ra) = 5S xlaader* = FS xle)e* = RA) 


aeG eG 
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and the result follows from |y(a)} = 1. 
2) Write N = du and a = du with d > 1 and ged(u,v) = 1. Let 


— Siny 


a primitive root of unity of order v. Note that 


Xa) = Wy oe 
1 N-1 . 
= LL Xe 
3=0 


= est hvyg? 
= ¥ ( Yo xu+ -] ¢. 
Jj {mod v} \k (mod d} 


It is thus enough to prove that 


S= SD xi the) 


k& {mod @) 


vanishes for all 7. Now, since x is primitive, there exists n = 1 (mod v) 
such that ged(n,N) = 1 and y(n) # 1. It is easy to see that (nj + kv) 
(mod N))x (mod a) is simply a permutation of (j-+kv (mod N)), (mod @)- Thus 


x(n)Sj = So x(nj + kv) = SG + bv) = 8} 
k (mod d) k(mod d) 


and since y(n) # 1, we have S; = 0. This proves the first part of 2). 
Finally, using Plancherel’s identity (theorem 7.A.5) for Z/N@ and the 
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information we have already gathered, we deduce that 


2M) xP 
a{mod N) 

YS r@r 
a(mod N) 


SS k@xa? 


ged(a,N}=1 
= PINIIR(DP, 


from where we obtain {X(1)| = Tr Using again part 1), the result follows. O 


7.4.4 Some applications 


We have developed enough theory in the previous sections to be able to 
prove quite a few nontrivial results. ‘The first one is very elementary, but tricky 
and taken from {71}. The interested reader will find in loc.cit many beautiful 
applications of finite Fourier analysis. 


Proposition 7.A.11. Let A be a finite set of integers and let f : A Z/pZ 
be a map. Then for any positive integer k there exist at least we (2k)-tuples 
(a1,..., 2%) € A?* such that 


f(ay) + flag) +--+ + flax) = f(@ns1) + f(Gaez) +--+ f(@2x) (mod p). 
Proof. Let N(j) be the number of 2k-tuples (a1,..., a2) € A®® such that 
f(ar) + faa) +--+ + flan) = F(aegi) + f(anzg) +++ + Flare) + J (med p). 


Clearly 37725 1 N(j) = |Al?*. We will prove that N(0) > N(j) for all 7, which 
will be enough to eis the desired result. Next, note that by the orthogo- 
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nality relations we can write 


yea} 
NG) = S EST HM Flour) Floa4) 9) 


B18 QKEA =O 


1S _ amy Binbfla) 
Se ( 
Ps 0 BEA 
pel Qinbfla) 
= 15h al a 
P to aA 


But it is apparent in this last expression that N(7) < N(0), finishing the 
proof. a 


‘The method used to prove the following result is extremely useful in ad- 
ditive combinatorics. 


Theorem 7.A.12. (D.Hart, A Iosevich) 
Let q be a E preme power, d a positive integer and let AC Fe be a set with 


more than q “} elements. Then for any x € Fj one can find a,b € A such that 
a-b=« there: is the standard inner ONE) in FA rap 


Proof. The idea is to express the number of solutions of the equation a-b = x 
using the orthogonality relations, then analyze the error term. Let x bea 
nontrivial additive character of F, and let n(x) be the number of solutions of 
the equation «6 = x with a,b € A. The orthogonality relations yield 


=-y>+ 7 xtc (a-b-2)) = MER, 
aber? ceF, 


where 


2S Y tee b-=2)) 


VGA bed cer, 
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Now, using the Cauchy-Schwarz inequality and once again the orthogonality 
relations, we obtain 


<DIV wel b- ol 


aca |bEA c¢0 


<9 - s XS x (a+ (e4by — c2bg))x(a{e2 ~ €1)) 


aGkg bi 626A 01 ,c26F5 


oi a S- x(2(c2 — ¢1)) Ss x(a - (eyb; ~ e2b2)) 


P ioea excneks ace 
= gt Al Ss > x(@(C2 — €1) ler =caba- 
bi .boeA c1,c2€F} 
Using the orthogonality relations and the substitution s; = 3, $2 = ¢3, 
we can write 


Ss S: x(@(c2 ~ €1))Leydy cab. 
by ,b2€A c1,c2 EF} 
Ss Ds sib; =bs > x(xsa(1 ~ 81)) 


by ,b2GA 81 Fs s2eFs 
(9 1b, =b. - > s Isyby=b2 
by ,b2EA 61,b26A a1 #1 
<(q-1) SS lnsn 
by ,ba€A 
< gal. 
Combining this with the first formula for n(x), we deduce that n(x) > 0 
if LA] > qe, from where the result follows. OG 


Corollary 7.A.13. Let A Cc Fy be a subset with more than qth elements. 
Then for any x € Fj there exist ay,...,aq € A and by,...,b4 € A such that 
@ = Gb, + agbg + +++ + agbe. 
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Proof. Apply the previous theorem to the subset A x A x-+- x A of Fa. ia 


The following bound on character sums is a basic tool in analytic number 
theory. 


Theorem 7.A.14. (Polya- Vinogradov) Let x be a primitive character modulo 
N. Then for ail positive integers m,n we have 


> x(7)) < VN log N. 


msc 


Before giving the proof, let us mention that Schur proved that if y is a 
primitive character mod N, then 


_— 
max, Ss x(n) > = VN, 
M cM | Qn 
so the Polya- Vinogradov inequality is not far from being optimal. 


Proof. Using Fourier’s inversion formula and een 7.4.10, we can write 


Adding this over j, using the triangle inequality and proposition 7.4.10, we 
deduce that 


An easy panies argument shows that sin« 22 


this to « = 44 (respectively x = ava) ND) ifa <! 
deduce hae” 


eforO<a2< % ets 
Gepeiay a> 8) we 


| > x) < VN - 1 < VNlogN 


mSj<n a<N/2 


and the result follows. o 
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Here is a nice application of the previous theorem, taken from [59]. We 
refer the reader to that paper for many refinements and further discussion: 


Theorem 7.4.15. (Murty) Let p be « | prime and let q ‘be @ prime divisor of 


p—1. Let a be an integer such that a T= (mod p). Then there exists an 


integer x such that jal < BP ge and 27 =a (mod p). 


Tn [59}, Murty proves that the hypothesis that g is prime is not neces- 
sary and that we can strengthen the conclusion by asking that |[2| < cp®/?/g¢ 
for a suitable absolute constant ¢. This requires some stronger estimates on 


character sums. 


Proof. Consider a parameter p > T > 1 and look at the number of solutions 
of the congruence 2? = a (mod p) with x € [1,T]. Using the orthogonality 
relations, we write this as 


S= 2 Incza (mod p) 


=P YE Grey 


x (mod p) 


Ere HT LX) Yo xn). 


nit 


In the previous sum we wil inguish those characters x such that y? = 1 
from the others. Indeed, if x? = 1, then V7 ,<p x%(n) = T and x(a) = 1 (since 
the hypothesis on @ implies that a is a gth power in F): while if x? # 1, 
Polya-Vinogradov’s theorem gives 


S x()] < vBlogp. 


nT 


Thus, since there are precisely g characters x such that x? = 1, the previous 
remarks yield 


qr p- 
Gis See <Proi-4g 
: p-l p-l Vp logp < ./plogp. 
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Now, everything follows from the simple observation that if S # 0, then there 
exists 1 <n < T such that n? = @ (mod p), while if S = 0, then 


p< PiteP “1. Qo 


If p is a prime, let n2(p) be the smallest positive integer @ such that 
a is a quadratic non-residue mod p. It is an easy exercise to check that 
nig(p) < 1+ ,/p, but it is much more challenging to find nontrivial bounds for 
no{p). The next result gives such an upper bound, by combining the Polya- 
Vinogradov inequality with Mertens’ theorems. 


Theorem 7.4.16. (Vinogradov) If p is a sufficiently large prime number, 
4 ¢ 
then there exists 1 <a <p? log? p such that 


Proof. Let m = [,/plog? p] and suppose that 


) 


ai, 
for all 1 <a < X = |p2# log? p- Let N be the number of quadratic non- 
& 
residues among {1,2,...,m}-. By Polya-Vinogradov’s inequality 


|m— 2N| = JE :) 


soN > 4,/plog p. On the other hand, any quadratic non-residue mod p 
must have a prime factor greater than X. Thus 


< Vplogp, 


Wt. Fos mm 

— fj < 

gy ~ gVbloep< Ns ) . 
X<qom 


i 
H 
H 
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Mertens’ theorem (theorem 3.A.5 and the remark following it) yields 


m logm m_ ) 
= + we] 
Ss q Oe eX : o(aty) 


X<qsm 


Note that oer = O(,/p > log p) and 


ibe log m =e 3 logp + 2loglogp | o(; 1 ) 
log X aug logp + 2log log p X log p 
sire OE go ( 2 
2 8 avenge * / \Xlogp 
1 4(f/e— I 
wi _ Ale = 1) log EP 40 (52 J. 
2 logp log p 


Combining these estimates yields 


m log lo 
> ~ }Vilogp < Ave - lm. ner O( /plog p), 
which is not possible for p large enough. Oo 


7.4.5  Dirichlet’s theorem 


We will use the previous tools and a bit of complex analysis to give a 
proof of the famous 


Theorem 7.4.17. Let a and N be relatively prime positive integers. For 
s->1* we have 


1 1 1 
Eso aM 6 (==3) +0: 


psa {mod N) 


In particular, there are infinitely many primes p = a (mod N). 


| 
i 
} 
i 
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The proof of theorem 7.A.17 combines Fourier analysis on G = (Z/NZ)* 
and very subtle estimates of the L function of a Dirichlet character near 1. To 
start with, define the complex L-function of a Dirichlet character x mod N 
by (recall that x(n) = 0 for ged(n, N) > 1) 


Ls,x) = x) 
n>1 
As |x(n)| < 1 for all n, this series converges uniformly on compact subsets of 
Re(s) > 1, so L(s, x) defines a holomorphic function in this region. Moreover, 
a simple argument going back to Euler and using the unique factorization 
theorem and the fact that <4, a = Varo £” for |x| < 1 shows that for Re(s) > 1 


we have 1 
(s, Sea 
x)= Il;- x(p)p™ 3° 


We easily deduce from this that L(s, Ki does not, vanish if Re(s) > 1. Moreover, 
if we choose a branch of the complex logarithm, we can write 


log L(s, x) = — }Slog(1 ~ x(p)p~*) = S7 > xe 
Pp 


Pon>l 


Dirichlet’s key insight was to use the Fourier transform to express the con- 
dition that » = @ (mod N) in an analytic way. More precisely, multiplying 
the previous relation by x(a), summing over y and using the orthogonality 


relations, we obtain for s > 1 
= x > at 


n>i 
=a (mod Ny) 


= ; = +00), 


p=a(mod N) ” 


the last estimate being a consequence of the inequalities 


1 1 
a = we Ll Lagan” 


Po n>z 
Tmzal re N) 
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The crucial point is that log L(s, x) remains bounded as s > 1* precisely 
when x is nontrivial and that log Z(s, 1) (where 1 is the trivial character) is 
relatively easy to handle and yields the factor log stp If the second part is 
rather easy to prove, the first part is a very deep result, equivalent to the 
non-vanishing of L{s, x) at's = 1 whenever x is nontrivial. Dirichlet’s proof 
was very roundabout, but we have quite a few different ways to prove this 
nowadays. First, let us deal with the “easy” part, which will also be important 
in the proof of the hard part. 


Theorem 7.4.18. L(s,x) extends to a function on Re(s) > 0, which is 
holomorphic except possibly at s = 1. If x is nontrivial, this function is holo- 
morphic at s = 1, but if x is trivial, we have 


7 1 
aim, (s —1)L(s,1) = II (: - :) ; 
BIN 
Proof. The key ingredient is Abel summation. Suppose that x is nontrivial 
and note that for all n we have [y(1) + x(2)+---+x(n)| < N, which follows 
easily from the orthogonality relations (theorem 7.4.5). An easy computation 
shows that 
n> —(n+1)78 = sn) + O(n”), 


which is uniform for s in compact sets. Thus the series 
S72) + x(2) ++ + x(n) (n-8 = (n+1)"*) 
n>1 


converges uniformly on compact subsets of Re{s) > 0. Moreover, by Abel 
summation, the sum of the series is L(s,y) if Re(s) > 1, which yields the 
holomorphic extension of £(s, x} to Re(s) > 0. 

On the other hand, if x is trivial, the inclusion-exclusion principle yields 


Hex)=T[Q-5)-ce a=. 
ae I x): a) ete 2 


Since 


ntl 1 yyl-s _. gl-s 
| (n*-#°) d= + (at) -n™ ; 
n 


ns s—l 
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we get 


But for ¢ € [n,n + 1] we have 


{ st 
jF ~n-F| = Jf —sn* del < 
nm 


yielding the uniform convergence of the series 
: +1 
os = es 
(=> (n* ~t>*) dt 
api 


on ail compact subsets of Re(s) > 0. Thus g is holomorphic on Re(s) > 0 and 
since ¢(s) = <4 4 + g(s), the result follows. 


Taking into account the previous theorem and discussion, theorem 7.4.17 
is a consequence of the difficult 


Theorem 7.A.19. If x is nontrivial, then L(1, x) # 0, so log L(s, x) remains 
bounded for 5 ~~» 1*. 


Proof. We saw that for all s > 1 we have 


ai ba x(a) log L(s, x) = S ae am a = 


x {mod N) n>h 
orca (mod a 


so by taking @ = 1 we obtain 


ll L(s,x) 21, 


x (nod N) 


But by the previous theorem all £{s,x) with y nontrivial are holomorphic at 
= 1 and L(s,1) has a simple pole at s = 1. Combining this observation 
with the previous inequality, we deduce that there is at most one nontrivial 
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character x such that L(1,y) = 0 (otherwise the product of L{s,x) would 
vanish for s -+ 1*), Assume that x is such a character, then clearly 


E(1,X) = 0, 


so that we must have x = Y, that is y takes only values +1 and 0. Thus, it 
remains to prove that (1, x} 0 whenever y is a nontrivial real character. 
We will present Paul Monsky’s elementary proof from [54]. 

onsider the function 


fa) = Dx), 


n>} 


defined for x € {0,1) (the series is obviously absolutely convergent for such x). 
We claim that f is unbounded as x -> 1”. Indeed, we can write 


fe) = Vox) S24 =F YE x@ | 2”, 


nol gzh n2i \adln 


all manipulations of the series being justified by the absolute convergence. Let 
nx ™ Dai, x(d). If p divides N, then. cys = 1 for all & > 1, as x(p) = 0. Also, it 
is easy to see that Cnn = Cmen for ged(m,n) = 1. An.immediate computation 
shows that cx > 0 for all primes / and all k and using the previous observation 
we deduce that c, 2 0 for all n. As infinitely many c,,’s are equal to 1 and 
all cp are nonnegative, 5°, ¢,2"' is not bounded as x -+ 17 and the claim is 
proved. 

Now, assuming that L(1,x)} = 0, we will prove that f is bounded as 
« — 1°, which will finish the proof. If 


i 
OS =a) 


then the hypothesis L(1, x) = 0 reduces the boundedness of f(x) as z > 1 to 
the boundedness of 57, bn()x(n) as ¢ + 1. The miracle is that the sequence 


A, Finite Fourier Analysis 331 


(bn(#))n is nondecreasing, as one easily checks using the AM-GM inequality 
that 


bn (x) — Bazi (x) 


i ~ a x" 
1 n(nt+1) (tet tar I(i+ep pan) 


£ 


is nonnegative. Next, using Abel’s summation formula, the monotonicity of 
the b,(x) and the fact that | 7", y(i)| are bounded, it is easy to see that f 
is bounded. But this contradicts the result of the previous paragraph, ending 
the proof of Dirichlet’s theorem. 


Once we know that L({1,x) 4 0 for all nontrivial characters xX, it is a 
natural question to ask how far it is from being zero. If we look carefully at 
the proof of the previous theorem, we see that it gives L(1,x) > 0 for any 
nontrivial real Dirichlet character. The following deep theorem gives much 
more. The proof is much more involved: 


Theorem 7.A.20. (Siegel) For any ¢ > 0 there exists e{e) > 0 such that 
LQ,x) > ae) for any real primitive Dirichlet character modulo N. 


7.4.6 A useful consequence 


The purpose of this section is to give an analogue of Mertens’ theorems 
for primes in arithmetic progressions. This uses the proof of Dirichlet’s the- 
orem and the standard technique of Abel summation. Recall that A is Von 
Mangoldt’s function, defined by A(n) = logp if there is a prime p and k > 1 
such that n = p*, and A(n) = 0 otherwise. Its usefulness in number theory 
comes from the relation Dodin A(d) = logn, which will be heavily used in the 
proof of the following result. 


Theorem 7.A.21. Let a,N be relatively prime positive integers. Then 


loge. Wee 
oye tay OO) 


PSE 
p=a(mod N) 
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Proof. Since 
log p 
ae 
tap P 
we have 
A 
See 3: Ae) + O(1). 
PSe nse 
psa (mod N) nza (mod oe 


Using the orthogonality relations, we can write 


A 1 aa (nr) 9 
Pg DRL ae 
nSatnod Ny x 7 


so everything comes down to the study of 
nr 
F(x,2) = a. 
nSr 
If x is trivial, Mertens’ theorem 3.A.4 yields 


F(i,2) = Wee 5p MEP + 1) log a + O(1). 


PeSe 
ged(p.N)=1 


Suppose now that x is nontrivial. We will prove that F(,a) is bounded as 
x ++ 00, which will finish the proof of the theorem. Exploiting the relation 
Duan A(d) = logn, let us consider the expression 


Since ¢ -> ge is decreasing for « > 3 and since the partial sums of the 
sequence (x(n)}), are bounded, Abel summation shows that E(x) = O(1). On 
the other hand 


E(@)= S> Oe) (a) = J = > AS ey 3 ae 


9 dy 
ddg sa dcx daSa/dy 
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Another Abel summation (using the fact that the partial sums Wher x(k) are 
bounded) shows that 


we = L(1y )- pw LA, y+o(t). 


Ga jor 


We deduce that 


1 
E(x) = F(x, x)L(1,x) +O (: Ea) : 
£ 
dca 
Since by theorem 7.A.19 we have L(1, x) #0, in order to prove that 
F(x, 2) = O(1) 


it is enough to prove that 


But 


So AC (d) = s logp < Dep? a Sat ss loga a(x) = 


dga prise pS« 


using theorem 3.A.2. The result follows, 


7.4.7 An “elementary proof” of Dirichlet’s theorem 


The proof of Dirichlet’s theorem that we presented used rather heavi Ly 
basic properties of holomorphic functions, but it turns out that the ideas used 
in the proof of theorem 7.4.21 may be combined with careful estimates tc 
obtain an entirely “elementary” proof of Dirichlet’s theorem, ie. completely 
avoiding the use of complex analysis. We will sketch such a proof in this 
section, but we must emphasize that the complex analytic approach is much 
more powerful and conceptual. 
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We will actually give an elementary proof of theorem 7.4.21, which ob- 
viously implies Dirichlet’s theorem. We will use the notations of arguments 
of the proof of theorem 7.A,21. The only “non-elementary” result used in the 
proof of this theorem is the fact that L(1,y) 4 0 for all nontrivial characters 
x. Monsky’s proof (see the proof of theorem 7.4.19) deals with the case when 
x is real in an elementary (but very tricky) way, so it remains to deal with the 
case when y is not real. Let k be the number of nontrivial characters y for 
which L(1,x) = 0 and assume that & > 1. Then Monsky’s result implies that 
k > 2 (since if L(1, x) = 0 then L(1,X) = 0). Either a direct computation or 
a simple application of Mébius’ inversion formula yields the identity 


DL Md) - Jog 5 = Iyer loge + A(n), 
din 


from which a standard computation yields 


F(x, a) = > XO) a(n) 


nse 
x(di da) z 
= —loga + B(d,) Jog 
Pa didy d, 
d 
=—loge+ 37 judy) log =. x(ch) | (da) 
qe dj dy 
ady<e dys ze 


The argument used in the proof of theorem 7.4.21 shows that F(x, 2) is 
bounded if £(1, x) ¥ 0. Assume that £(1,x) = 0. Then Abel’s summation, 
formula shows that Dése x) = O(1/z), so the previous formula yields 


F(x,2) = —loge +0 (: Yo to i) 


ay <a 
=-logr+0O ( Hlce= — lee) 


= ~logz + O(1). 
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The conclusion is that 


So FOG a) = (1 — k) loge + O(1). 
x 
But this contradicts the fact that k > 2 and (again by the orthogonality 


relations) rm 
TFs) =n). Meso, 


nse 
n=l (mod N) 


The reader has probably noticed that the crucial part of this argument was 
an “elementary” version of the argument used in the beginning of the proof 
of theorem 7.4.19. 


Chapter 8 


Formal Series Revisited 


In this chapter, we discuss applications of generating functions in com- 
binatorics or combinatorial number theory. This is a rather vast subject and 
we will only be able to scratch its surface through some nice examples. The 
idea is very simple (even though in practice things are less clear): given a 
sequence (@p)n of complex numbers (but we may allow this sequence to take 
values in any ring, in theory), one is sometimes able to extract information 
about the sequence from its generating function Vaso @nX” in an easier way 
than by dealing with the sequence itself. For instance, if the sequence satisfies 
a rather complicated recursive relation, it is very hard to study the sequence 
directly, but quite often it turns out that its generating function satisfies some 
differential equations or some regularity properties that allow us to study the 
sequence. Also, in counting problems it is very often much easier to find the 
generating function of the desired number of objects to count than to find di- 
rectly that number. Of course, one sometimes needs quite a lot of imagination 
to extract the information from the generating function, but it should be a 
principle that if one knows the generating function, then one knows a lot of 
things about the sequence. It is important to note that when dealing with gen- 
erating functions one neglects all convergence and analytic issues. However, 
once one knows the generating function, one usually uses analytic arguments 
to study the sequence. 
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Before passing to concrete problems, let us recall a few things about op- 
erations with formal series. Addition and multiplication are defined just as for 
polynomials. It is an easy exercise to check that a formal series has an inverse 
(for multiplication) if and only if the constant term is nonzero. A more subtle 
problem is the composition of formal series. Here, one has to be a bit careful, 
as simple things such as }7,, (aay do not make sense formally. To do things 
properly, the best way is to introduce the X-adic valuation on formal series: 
if f =ag+a,X+--- is a formal series with complex coefficients (more gener- 
ally, any field), let ux (f) be the least nonnegative integer n such that a, 4 0. 
Thus f - X—*x‘/) is an invertible power series and one easily checks using this 
that vx (fg) = vx(f) + vx(g) and ox(f +g) = min(vx(f),vx(g)). Thus vx 
behaves as the p-adic valuation. 


Definition 8.1. Say a sequence of formal series f,, converges towards a formal 
series f if ux(f — f,) tends to oo. If f = ag +a)X +---, this means that for 
all i, the coefficient of X* in fy, is a; for all sufficiently large n. One says that 
f = Dapo fn, respectively f = T],>9 fn if the sequence (fo + fr +--+ fn)n 
(respectively (fofi---fn)n) converges to f. 


It is an easy exercise for the reader to check that )°,,59 fn converges to a 
formal series if and only if vx (fn) tends to infinity (the analogous result holds 
for p-adic numbers). Also, if f, are formal series such that f,(0) = 0, then 
TIa>o(1 + fn) converges if and only if vx (fn) tends to oo. We can now explain 
the composition of two formal series. Assume that f,g are formal series such 
that 9(0) = 0 and f = ay + a,X +agX?+-+--. The composition f og is then 
by definition the formal series 


fogsagtag+ag? +. 


The series converges, because ux(g") > n. All other operations on formal 
series (such as differentiation, integration) do not involve any difficulty and 
have the properties that we imagine. 

Before passing to problems, let us recall the following useful extension of 
the binomial formula: for any rational number a we have 


a+xe=>0 (f) 


n>o 
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~1)--(o— F . 
where @ = wera fomntl) A very useful special case is a = —m, where m 
is a positive integer. In this case the binomial formula becomes 


Finally, we will sometimes use the classical notation [X "1 f for the coeffi- 
cient of X” in f. 


8.1 Counting problems 


The first exercise is quite technical, but there is no serious difficulty in it 
and the result is quite beautiful. Recall that if x, is a sequence of complex 


numbers, an equivalent for it, is a sequence y, in closed form such that 2a 
converges to 1. a 


1. Let @1,@2,..-,@n be relatively prime positive integers. Find an equiv- 
alent as k -+ oo for the number of positive integral solutions of the 
equation @)2 + gry + +++ -bdndn = k. 


Proof. The generating function of the sequence yg, the number of positive 
integral solutions of the equation aj; + agz2 +... + antn =k, is 


1X) = Tonex* 


b> 
= XU A tantn 


Fp O2 En 2h 


1-xX% 3 


This is a rational function and in order to analyze sequence Ye we will 
decompose it into simpler pieces. Namely, if 1 = 21, z2,...,z» are the distinct 
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roots of the polynomial []j.,(1 — X%), with multiplicities m ,ma,...,7™r, 
then by general theory there exist constants Cj; such that 


F(X) = Coo + ‘aps ay 


j=] jei 


Using the binomial formula, we obtain 


Oa = (-z7!)/. (1 = x)" = (ay at(? ae "xt. 


k>0 


Inserting this in the formula for f and re-arranging terms yields 
rom ; 
i hen) (Frk=1 
m= da, 


& ee ye ~(k+3) Gantt Le (K+ gj ~ 1). 


i=l j=l 


We claim that the dominant term in the previous sum corresponds to 
i= land j =m, =n. First, note that m; <n for all i #1. Indeed, each of 
X* —1 has simple roots, so the only possibility for m; > n would be that 2; is 
a root of all X% — 1, thus also of X —1 and soi = 1. Thus the contribution to 
yx of all terms with i 4 1 is dominated by ck” for some constant c. A similar 
argument shows that the contribution of the terms with i = 1 and j < n is 
also dominated by a constant times k"~?, ‘Thus the main contribution is that 
of the term with ¢ = 1 and j = n, which is so uS (k+1)---(kK+n—1Cin. To 
find Cin, note that 


Cin = lim f(2)(e ~ 1)" = (~1)" lim = 
jamb 


Combining the previous observations yields the equivalent ce Ae ae for yx. 


Ay On 
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The following problem is a bit trickier, but it shows how formal series in 
several variables appear quite naturally. 


2. For positive integers m and n, let f (m,n) denote the number of n-tuples 
(x1, %2,.. sh of integers such that [a1] + [vol +---+|za| <_m. Show 
that f (m,n) = f (n,m). 


Putnam 2005 


Proof. We will use a erste generating function in this problem: with 
the convention f(m,0) = f(0,n) = 1, we have the chain of equalities 


FIXY)= D0 fimn)xmyt= Soxmyve St 
m,n>0 mn20 [Ralt---+]Rn im 
7 Xlaltt) yr : 
pa aes ae rs (5>*) 


n200 kiy--yhn€Z 


Since F(X,Y) = F(Y,X), it is clear that f(m,n) = f(2,m) for all m,n 
Q 


3. For n 2 3and AC {1,2,...,n}, say A is even if the sum of the elements 
of A is an even number. Otherwise, we say that A is odd. By convention, 
the empty set is even. 

a) Find the number of even, respectively odd subsets of {1,2,...,2}. 
b) Find the sum of the elements of the even, respectively odd subsets 
of {1,2,..4,%}, 


Romanian TST 1994 
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Proof. The generating function for the sums of the elements of the subsets of 
{1,2,...,7} is []f.,(2 +X), because of the obvious identity 


Te +X) = yo xm), 


tml A 


where m(A) = }{4e4@- Let E(n), respectively O(n) be the sets of even, 
respectively odd subsets of {1,2,...,n}. Taking X = —1 in the ake, 
identity yields 0 = |E(n)|—!O(n)|. Since we clearly have |E(n)|+ {O@)| = 

we deduce that the number of odd, respectively even subsets is 2°-!. To answer 
the second question, we differentiate the identity, to get 


n 


Sox Tat x4) = So maya, 
A 


i=1 iH 
Taking X = —1 yields 
So om(A)= S2 m4). 
AEO(n) AGE(n) 
On the other hand, taking X = 1 gives 
So m(A)+ S> m(A) = n(n + 1)2"%. 
AEO{n} AGE(n) 


We deduce that the answer to the second question is n(n + 1)2"~9 for both 
quantities involved. oO 


Remark 8.2. Here is an sereeatds Peek for the second part. 


On = On—y + bai > Once, bn = bya tani tn Dene 


for m odd, as the odd subsets of {1,2,...,n} are either odd subsets of 
{1,2,...,2~ 1} or the union of {7} and of an even subset of {1,2,...,2—1}. 
A similar analysis shows that 


On = Wayuy tn 22, by = Wp tng? 
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for even n. By induction, we deduce that an = 6, for all n and then solving 

the previous recursive relation (which becomes a, = 2p. +n-2"~*) yields 
@y, = 2"-Fn(n + 1). 

We continue with a very nice problem, though quite simple. 
4, How many polynomials P with coefficients 0, 1, 2, or 3 satisfy P(2) = n, 
where 7 is a given positive integer? 
Romanian TST 1994 
Proof. Let a, be the number of such polynomials. Then a», is also the number 


of solutions of the equation x9 +2x1+-+-+2,2* = n in integers 2; € {0,1,2, 3}, 
for varying k, Thus, the generating function is 


Yoanx” = TT (1+ x + x2* + x8), 
n>o k20 
This is also equal to 
. 1- x8 . 1-x 
1-X? 1-4 : 


which simplifies drastically to 
1 
Oy XO eee 
> i G-xX)G- x4 


Now, we can decompose the last rational fraction in simple elements, following 
the standard procedure. A simple computation yields 


i af a : i i 

@-x)a-xy 4 (x Tx) + oq Ky 
Expanding this once more, we deduce that the answer is iB | +1. 
Note that we could have avoided the simple elements decomposition, be- 
cause Wexaexy is simply the generating function for the number of ways 
to express m as @+ 2b, where a,b are nonnegative integers. There are 13] +1 
possible choices for 6, each of which leaves one choice for a. So there are 
(BI + 1 ways to express n as a + 26 and the conclusion follows. Oo 
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Remark 8.3. Here is ancther nice solution: if P(X) = ag + aX +--+ agX* 
satisfies P(2) = n, define 

, [ek] ok 

+ | | 2”. 


sP)= [F141] 2+ 


We obtain a map f from the set of polynomials P as in the statement of the 
problem, to values in {0,1,..., | |}. Tt is easy to check that this map is a 
bijection. 

In the next two problems, one combines an easy combinatorial argument 
with a generating functions argument. Such mixtures appear very often in 
combinatorial problems and in this case generating functions do the dirty 
work of solving quite complicated recursive relations. Let us start with an 
absolute classic. 


5. In how many different ways can we parenthesize a non-associative prod- 
uct 21£9°++ 2p? 
Catalan 


Proof. Let a, be the desired answer. Note that in a product of k, respectively 
n~ k factors, we can put parentheses in ax, respectively ¢n_, ways. Looking 
at the position at which the first parenthesis ends, we deduce the recursive 
relation a1 = 1 and aq = SORZ} ap@n_n. 

Next, define f(X) = Yast @pX” and observe that the recursive relation 
implies the chain of equalities 


‘nl 
{xP =k? +S (Sim) X” = f(X)~ X, 
nz3 \k=1 


which implies (taking into account that (0) = 0) that 


=5-50-4x} 
1 1 n(4U2\ jnyn 
=575 es yn (Manx 
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Now, 
1/2 Be 
e / \e 14a (-1)". 1-3 te tl 1)" 9 = 2 (2n~2 
n} “n\n-1 
and so, using the previous relation yields a, = 3 baa i] 


6. Let F(n) be the number of functions f : {1,2,...,2} -> {1,2,...,n} 
with the property that if 7 is in the range of f, then so is j, for all j < 4. 
Prove that 


L. Lovasz, Miklos Schweitzer Competition 


Proof. The key point te obtain a recursive relation for the sequence F(n) is to 
look at |f~1(1)}. If [f~1(1)| = 7, then the j elements of f-1(1) can be chosen 
in G ) ways and f can be defined in F(n—j) ways on the remaining elements. 


Thus, there are (")(n — 7) maps f satisfying the property in the statement 


of the problem and for which |f~'(1)} = 7. Summing over all possible values 
of 3, we cover all functions f satisfying the property in the statement, so 


n n=l 
mh 5 n , 
=> (i)Fn-a=¥ (") ro. 
ja joo MF 
Here we took by convention F(0) = 1. Considering the exponential generating 


function f(X) = S35 E@) X", the previous relation yields 


F(X) =14 (€% DF) = 10 = 5 1 


Next, we can write 
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Expanding now 

aX nX (nx)? 

eh ze Lt TT + 3 
and identifying coefficients in the previous equality yields the desired for- 
mula for F(n). Note! that we cheated a bit, since D1, Gr does not 
converge X-adically, but this can be easily fixed: consider the function 
f@)= S30 EQ) 27, defined in a neighborhood of 0 in C. This series con- 
verges absolutely and the previous computations show that f(z) = 54, if z 
is close enough to 0. But then we can expand 


sist 


rd 


€ 


n>0 


where this time the series converges in C. Expanding e”’, collecting terms 
and identifying coefficients yields the result. oO 


We end this section with two more challenging problems. The first one is 
a very classical result in the theory of finite fields. 


7. Let N, be the number of irreducible monic polynomials of degree n with 
coefficients in Z/pZ. Then for all n we have S)q), dNa = p*. 


Proof. Write F, = Z/pZ and consider the generating function 


Fe SO xtes, 


SEF p|X] 


the sum being taken over monic polynomials f ¢ F,[X]. As there are p” monic 
polynomials of degree n with coefficients in F,, we have 


"y 
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On the other hand, the unique factorization of monic polynomials into 
products of irreducible monic polynomials yields 


1 


Xdegh? 


pa [[e xe 4 x20 TT 
Ah R 


the product being taken over the irreducible monic polynomials h. Taking the 
formal logarithm yields 


i 1 1 
log ie SY log syaak 37 Nn log Sit 
h n>0 
The desired formula is easily obtained from this equality and the classical 
expansion 
1 xk 
log —~; = s —_. 0 
An-X Psi k 


Remark 8.4, Using Mébius’ inversion formula and the result of the previous 
problem, we obtain 


i B 
Np = s uld)pe. 
din 


It is fairly easy to see from here that N, > 0, so there exists at least one 
irreducible polynomial of degree n over Fp, This result is absolutely not trivial. 
There is also an arithmetic solution of the previous problem, the idea being 
to prove the stronger result 


T] f=x?-x, 
fe®,{X], 
deg fin 


where the product is taken only over those irreducible monic polynomials 
f © F,[X] whose degree divides n. 
8. Let ¢ and y be noncommutative variables. Express in terms of n the 
constant term of the expression (e+ y+a@71 4 y™1)", 
M. Haiman, D. Richman, AMM 6458 
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Proof. Note that variables cancel in variable/inverse pairs so the constant term 
is zero for n odd. Let Gm» be the number of products uyug-+- tn where each 
u; is one of {z,y,27!,y~1} and after cancellation we are left with a word of 
length m. Note that ag, is the desired constant term. By convention we set 
@14n =. Starting from a word of length m > 0 there are three ways we 
can add one more term on the right and produce a word of length m+ 1 and 
one way we can add a cancelling term and produce a word of length m — 1. 
Therefore we have 


5 — [3m-tn + Omsin MAL 
monet 4a0.n + Gan mal 


Letting An = S779 @mnX™ we can rewrite this as 
Ans = (8X +X) An + aon(X ~ X74). 
This is a first order linear difference equation in A, and it can be solved by 


standard techniques to give 


n-1 
= (BX +X" + SV donemes(X — XA)BX +X)", 


m=0 


By definition it is clear that A, is a polynomial in X, therefore the coefficient 
of any negative power of X in this series must be zero. In particular, setting 
n= 2s +1 and looking at the coefficient of X7} gives 


23-41 2m m-if 2m 
HOT) = Zorn [Gn 
Forming generating functions of this identity gives 
o oo co 
et" + "rs = 3 a) _ gt 2m )| T™ 3 aoaT" 
8 m mo , 
a=0 rax0 r=O 
Let a(P) = 37°25 ao,arT™. Recognizing the other sums via 


PE oe ase Se ryn(- MN gm = pata, 


mad 
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(* + ") I ie + ’) 2m m+ 2m 
=> and ~ 
8 2\s41 m~1 m is i 
we can write this as 


_ 1 i = 1 1—6T 1 " 
eVi-isr  6r ~ (+ “ToT” ish Vi 1oT * ier) att): 


Solving for a(T’) and simplifying gives 


a(T) _3 _ _ Wi-T-1 
° 142/1- 127 1-16T 


Extracting the coefficient of 7” using the binomial formula gives 


ana 2s gttr—8) _19y8 Go) _ tr 


k=0 


z ofr = m4 g(r k)+2gk (op 2 
k k-1 


ket 


8.2 Proving identities using generating functions 


Generating functions are a very powerful method for proving combinato- 
rial identities. Namely, we compute the generating functions of both sides of 
the identity we are given and we prove that they are the same. Here are a few 
examples. 


9. Prove that for all positive integers n, 
n 
n+k— : 
> = Fon, 
k=1 ( ea 
where F,, is the Fibonacci sequence (with Fy = F) = 1). 


Iranian Olympiad 2008 
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Proof. The generating function of the left-hand side is 


Er E(s)-EEe(a) 


nzi kd k>1n>k 


On the other hand, we have 
n+k-1 BRO (R42 ATW ns nae 
Ex( )-"5( X" = XML x), 
= -1 
Sh 2k-1 Berd 2k 


by the binomial formula. 
So the generating function of the left-hand side is 


Xx 
S k OR cp 
20%) ~ X?—3X +1 


It remains to compute the generating function of the sequence Foy. This is 
very easy, since we know that F,, = *.-%-, where z,y are the roots of the 
equation ¢? —t—~1=0. Thus 


SOX" Fon 


n2i 


(e~ y)(1— a? X)(1—y?X)" 
Since x+y = 1, cy = —1, we easily deduce that 
(L—#?X)(1—y?X) = X*-3x41 
and the result follows. im 


10. Let n and k be positive integers. For any sequence of nonnegative in- 
tegers (a1, @2,...,@%) adding up to n, compute the product azag-+: ag. 
Prove that the sum of all these products is 


n(n? — 17)(n? — 27) .-» (n? ~ (k — 1)?) 
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Proof. Let f(n,k) be the desired sum of products and consider the generating 
function 


KX) = SO fa kx"= SOS 


n>o N20 G1 +agen 


482 -++ aX” 


By 421.0420 i>o 
Xo \F 2 
= (agp) -xt0-20, 


Expanding (1 — X)~** thanks to the binomial formula and using the previous 
relation, we deduce that 


f(r, k) = (-1)"-* ee as ee a a 


On the other hand, it is easy to check that 


n(n? — 1?)(n? ~ 22)... (n2 ~ (k ~ 1)*) _ {ntk-1 
@k-Di ea ) 


from where the result follows. 


Remark 8.5. Here are a few other such identities that can be easily proved 
using generating functions: 


a) paar eccrine @102°++ a, = Fo,, where the sum is taken over all or- 
dered partitions of n and F, is the nth Fibonacci mramber. 


b) artarttaymn (2 oo 1)(20-1 ae 9 iia (2e—4 —1) = Fonz 
We continue with a very nice combinatorial identity, for which we give 


two proofs: a natural one using generating functions and a more subtle, purely 
combinatorial one. 
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ll. Let m,n be positive integers with m > n, and let S be the set of all 
sequences of positive integers (41, @2,-..,@n) such that a +aqt- + +4, = 


m. Show that 
n ln 
41942 | Gn on pyri im 
S72 1m ats ent = SO(-1) (i 


{@1)...Gn)ES isl 
Palmer Mebane, USA TST 2010 


Proof. The solution using generating functions is rather straightforward. In- 
deed, note that: 


xm, ( s poigae.. oe) 


man Aye bane 


li 


s XY 2X)” ae (nX)% 


i 
a 
* 
ult 
& 


»¢ 2X nX 


On the other hand, we have 


yo x (S-9(1)") = dene (") 


man 


a~xXja- 


But the theory of simple fractions decomposition shows that there exist 
Aj, Ao, .--,An such that 
n 


nt 
(1— X)G —2X)-.-G—nX) ao 2 


t=1 


Aj 
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Multiplying this by 1—7X and taking X ~> 2 yields the expression of A; and 
a trivial computation shows that A, = (~1)"-#(?)i". cg 


Proof. Define k = m~n and let T be the set of sequences of nonnegative 
integers (6;,62,...,0n) such that by + by +--- +b, = &. The substitution 
b; = a; ~ 1 transforms the desired identity into 


nm 
nh SD 1a. nde = de-pretn("). 
a c j 
T jal 


To prove this, we will count in two ways the colorings of k + n objects with n 
colors such that each color is used at least once. 

The first method of counting is to line up the objects in some order and 
to consider the first appearance of each color. Suppose object ¢ is the first 
appearance of the ith color (in order of occurrence). Let bj = ¢ia1 —G — 1 
(with p21 =n +k+1) be the number of objects between ¢; and Gy1. Any 
choice of (b1,b2,...,5n) with by +b +-+++by = k also gives a valid choice 
of the ’s via q = i+ Dyat b;. Now for the b; objects in between the first 
appearance of color ¢ and color i + 1, there are i ways to color each one since 
only ¢ colors have appeared so far. Therefore, for each choice of (by, by,... bn) 
the number of ways is 122%... n>, Summing over all possible cases and 
taking into account that the order the colors appear in can be rearranged in 
nt ways, we get n! S>p 152% ...n'e ways to color our ob jects. 

On the other hand, there are n*+" ways to color k +n objects with n 
colors and there are (,",)(n — 1)**" ways to do the coloring with n — 1 of 
the n colors, (,"4)(n — 2)*+” ways with at most n — 2 colors and so on. A 
standard inclusion exclusion argument shows that the number of admissible 
colorings is 


(")nten— ("inti eayea(D)aten 4 caye(*) 


and we are done. 
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8.3 Recurrence relations 


Generating functions are an extremely powerful tool to deal with the 
complicated recurrence relations that appear quite often in enumerative com- 
binatorics. The reader is warned that the problems in this section are rather 
technica] and difficult. 


12. Let Ay = @, By = {0} and 
Anyi = {1 +2) 26 Bn}, Bayi = (An \ Bn) U (Bn \ An). 


Find all positive integers n such that B, = {0}. 
Chinese Olympiad 


Proof. Using the relations given in the statement of the problem, it is imme- 
diate to check that 


TBaii(k) = 1a, (kb) +1p,.(k-1) (mod 2), 


where 14(z) = 1 if 2 € A and 0 otherwise. This relation suggests considering 
the sequence of polynomials defined by 


Py=0, Py=l, P(X) = Paui(X) + XPh-a(X). 


Looking at the two recursive relations, it is clear that, modulo 2, the coefficient 
of X* in P,(X) is simply 1g, (k). To solve this recursion, it is convenient to 
introduce a new variable T and set X = T+ T?. To avoid confusion let 
Qn(T) = P,(T + T?). Then this relation becomes 


QnlT) = Qn-1(T) + T+ T)Qn-2(T). 
This relation is easy to solve mod 2 giving 
Qn(T) = (T +1)? +7" € FA[Z). 


It is obvious either from the original recursion or because Q,,(T) = Qa(T +1) 
(mod 2), that Q(T) can actually be written mod 2 as a polynomial P,(T+T*). 
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Explicitly finding P, might be tedious, however we do not need to do so since 
we only care about when B, = {0} or equivalently P,(X) = 1 or Qn(T) = 1. 
It is easy to see from the formula above that this occurs if and only if n is 
a power of 2. More explicitly, from Legendre’s formula it follows that the 
smallest k > 0 for which ({) is odd is k = 2%"), Therefore Qn(T) has degree 
n~ 2) as a polynomial in T and P,(X) has degree 3 (n— 2") as a 
polynomial in X. Therefore Q,,(T) = 1 if and only if n is a power of 2. 


Proof. As above, we consider the sequence of polynomials defined by Po = 0, 
P, =1 and 

P(X) Feast Pra(X) + XPy-2(X)} 
and we observe that, modulo 2, the coefficient of X* in P,(X) is simply 1g, (k). 


We consider the generating function P(X,Y) = nop Pn(X)¥® and observe 
that 


(1~¥ — X¥*)P(X,¥) = Po(X) + (P(X) - P(X))Y 


+0 (Pal X) — Proi(X) ~ XPhio(X)¥" =¥, 


n=2 
so we get 
x 
P(X, Y) = —__— 
Gh) 1-Y—xXyY? 
= Soya xy)" 
mal 
foe] m 
a > > (") Xiymesti 
mao ja \F 
pap ee ‘xer. 
n=O j 3 


Thus the problem is reduced to finding all n such that (7374) is even for 
all n > 7 > 0. We will prove that this happens if and only if n is a power of 2. 
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Consider the representation of m in the form 02" where o is an odd number. 
If n is not a power of 2, then o > 3 and we have (""2--4) is odd. If n is 
a power of 2, then we need to prove that C3") is even no matter which j 
we choose. Using the Legendre formula we are done if we can find an m for 
which | =| > | =p | au |e . We can choose one more than the binary 
logarithm of the highest power of 2 that divides 7. Thus the conditions are 
that 2 is a power of 2. 


The following problem is very technical and its solution is too. But these 
kind of problems appear quite often in real life and we think it is rather 
important to be able to deal with them. 


9 13. Suppose that a9 = ay = 1 and (n+ 3)anga = (2n + 3)an + 3nap—1 for 
n > 1. Prove that all terms of this sequence are integers. 
KéMaL 
Proof. The first step is to consider the generating function 
£(X) = So ay”, 
neo 

Multiplying by X” the recursive relation and adding these relations yields 

Sn + B3)ang1X" = San + 3)anX”" + SS 3ndn 1X”. 

n> n>t nl 
Now, since f’(X) = Yyasy nay,X*-1, it is very easy to express each of the 


previous sums in terms of f and f’. More precisely, a straightforward compu- 
tation yields 


Slr + 8)ansX" = £1) + ZH) —1)~3, 
n>i 

So (2n + 3)anX" = 2X f'(X) + 3f(X) ~ 3 and 
nat 

ST Brana X™ = BX f(X) + 3X? /'(X). 


nazl 
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Replacing these expressions in the previous equality and collecting similar 
terms, we end up with the differential equation 


f'(X)(X — 2X? ~3.X3) + f(X)(2— 3X ~ 3X?) -2=0. 


Now comes the technical part, which we will skip: solving this differential 
equation, There are standard methods to solve this, but unfortunately when 
one uses them in this case, one obtains rather horrendous expressions. Thus, 
we leave to the reader to convince himself that the resolution of this equation 


yields 
f(X)= L-X ~ V1 ~ 2X — 3x? 
2X? . 
And now? It is absolutely not clear that the coefficients in the Taylor expan- 


sion of f are integers! The tricky point is to write 
(1-X —2X*f(X))? = 1-2x —3x? 
exh 1+ (X — 1) F(X) + X°F(XP = 0. 


And now we are saved, because if we identify the coefficients in the previous 
relation, we obtain the following recursive relation 


nm 
Gn42 = Aney + Ye aK an—x. 
k= 
And since the first terms are integers, the previous relation shows inductively 
that all terms of the sequence are integers, Another elegant way, found by 
Richard Stong, is to note that 


a X?k 
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and that ¢ty CH = GY) - GS) is an integer (this is the famous Catalan 


number). We leave as a challenge to the reader to prove directly from the 
recursive relation that : 


mR 
On42 = Onq1 + Ss OpGn—k 
k=0 


3 -— 1 (2) 
mS . 
oe k+l k/ \2k 
He will probably appreciate the power of generating functions (as far as we 
know, the proofs by induction are hideous, to say the least... ). oO 


or that 


Remark 8.6. The numbers appearing in the previous problem are called 
Motzkin numbers and they have nice combinatorial interpretations. Here is 
one of them: a Motzkin path is a lattice path from (0,0) to (nm, 0) with steps 
(1,0), (1,1) and (1, ~-1), never going below the z-axis. Then a, is the number 
of Motzkin paths of length n. This is a beautiful and not obvious exercise that 
we leave to the reader. Another good exercise is to prove that a, is also the 
number of paths on N with n steps, each step being —1,0 or 1, starting and 
ending at 0. To make everything precise, let us recall that if A is a subset of 
Z”, then a lattice path of length / from x € Z" to y € Z", with steps in A is 
a sequence vp = @,01,...,% = y of elements of Z” such that vj — uj € A for 
all i. 

We give three different solutions for the following beautiful problem. The 
first. two proofs are a bit technical, but quite natural. The third one is very 
short and elegant, but not very natural. 


qY 14. Consider (bn)n>1 a sequence of integers such that 6; = 0 and define 
ay = O and ay = by + aybny + +++ + Gn—1by for all n > 2. Prove that 
play for any prime number p. 


K6MalL 
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Proof. Considering the generating functions 


A(X) = So anX",  B(X) = Son X", 


n>2 n>2 


the recursive relation can also be written in the form 
A(X) = XBY(X) + A(X)B(X), 
from where we obtain the fundamental identity 


XBYUX d 
A(X) = ay = ~X Fy log(1 - B(X)). 


Now, we have the following very useful result: 
Lemma 8.7, Any f € Z([X]] with constant term 1 can be written in the form 


t=][0-ax) 


é=1 
with a; € Z. 


Proof. Write 
FX) =14+AX+ f_x*4-. 


(1~a)X)---(1~anX")f(X) = 1 (mod X”), 


This expresses a,, as a polynomial with integer coefficients in aj,...,@,—1 and 
fi,--.s fn, from where the conclusion follows immediately by induction. © 


Using this result, let us write 


1~ B(X) =[] (@-ex*)* 
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with c; € Z. Then 


A(X)= we log(1 — ¢X*) 


izl 
--y 
1—¢X* 
i>t 


=o Sie X* +2 X74 --.), 
1 


Since ap is the coefficient of X? in A(X), the only contribution comes from 
the terms with i = 1 and i = p and since the contribution for i = p is clearly 
a multiple of p, it is enough to check that c; = 0. But this is clear, since 


1~ B(X) =1 (mod X°). ii) 
Proof. Let us modify the definitions of the generating functions a bit, by 
imposing different initial terms. Namely, define ag = a) = 0,b9 = —1,0; = 0 
and 


oo mm . 
A(X) = Sha: X#, BUX) = $7 —-6,x¢. 
imd t=O 


Then the recursive relation satisfied by the sequences implies that 
A(X) B(X) = ~XB'(X), 
so that 


A(X) _ BY(X) 
X  B(Xy 


Integrating both sides we get 


ies) ye ; 
do AE = tog (1 = (0X? + 5X84 +). 


im? 


Since ee 
log(l ~ X) = —X ~ = ene 


br=Galor 
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ize tomer ZrO 


we deduce that 


con |B ' Bape }o ne . 2 ns + 
of XP in S152) 4 +)" Since the denominator of this coefficient. is 
clearly relatively prime to p, it follows that plap. 


Proof. Let. 1, %2,..., Zp be the roots of the polynomial 


XP by XPO! + bo XPW? 4 bg XPO3 4. 


++ bp. 
Comparing the identities @, = 6) and ay, = nbp + a1bnoa +++) + an—1b1 for all 
n > 2 with Newton’s identities, we easily deduce that 


sa cal 2 é 
Oy = B+ EQ +a, 


for every i € {1,2,...,p}. On the other hand, 6; = 0 implies that 
DpH La+--- + 2p =0. 


The desired result follows then from corollary 9.15. Q 


8.4 Additive properties 


When studying additive properties of sets A C Z, it is very useful to 
consider the generating function f = }7,-4 X*. For instance, the square of f 
encodes the number of solutions to the equation a+b =n, with a,b € A. This 
rather innocent-looking observation yields quite a lot of nontrivial results and 
the purpose of this section is to present a few of them. 

Quite often, it is more convenient to reduce the generating function mod- 
ulo suitable primes, as this simplifies a lot the computations: it is a very 
fortunate feature of F, that (1+ X)? = 1+ X? in F,[X]. 
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Wis. Let A be a finite set of nonnegative integers. Define a sequence of sets 
by: Ap = A and for all n > 0, an integer a is in An4; if and only if 
exactly one of the integers a—1 and @ is in Ap. Prove that for infinitely 
many positive integers k, A; is the union of A with the set of numbers 
of the form k+ a with a € A. 


Putnam Competition 2000 


Proof. The key point is to note that the definition of Ay41 in terms of A, can 
be expressed algebraically by 


SS x*=(14+.x) So x 
BEAR aAy 
for all n, the equality being in F)[X]. We deduce that 
So X* = (14 xX)" 5° xt 
a€An aca 


for all x. On the other hand, the condition that A, is the union of A and 
A-+n can be also written (when n > max A) in the form 


SOX? = (14 X") $7 x8. 
aGAn acA 


Thus, it suffices to find infinitely many n such that (1 +X)” = 1+ X” in 
F,[X]. Simply take for n a power of 2 greater than max.A and we will have 
Ag = AU(A+n). 


The following problem is an absolute classic. 


16. Prove that if we partition the set of nonnegative integers into a finite 
number of infinite arithmetic progressions, then there will be two of them 
having the same common difference. 


Proof. Suppose that N is partitioned into the arithmetic progressions a;N + 6;, 
with aj, > 0 and aj < ag <--+ <a,. We will actually prove that an} = an. 
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Suppose that this is not the case and note that the partition hypothesis yields 
an identity of formal series 


1 ba n xe 
Tox 7 = ee 


i=1 k20 i= 


However, the right-hand side has a pole at a primitive a‘? root of unity, while 
the left-hand side definitely does not have such a pole. 


The following problem is much trickier. 


¥17. Let p bea prime and let n > p and a1,49,...,@, be integers. Define 
fo = Land fy the number of subsets B c {1,2,... 7} having k elements 
and such that p divides 57,2 @:. Show that fg — fi + fo—++-+(-1)" fa 
is a multiple of p. 


Saint Petersburg 2003 


Proof. Adding to all of the a;’s a suitable large multiple of p (which does 
not affect the hypothesis or the conclusion), we may assume that the Q;’s are 
positive. The point is to consider the remainder of 


f(X)=- XxX) ~X%)...(1- xX) 
= 0-1 SS x) ¢ FLX] 


k=0 BCA 
[Blak 
modulo X? ~ 1, where m(B) is the sum of the elements of B. On the one 
hand, the hypothesis n > p and the fact that 1- X divides 1— X imply that 
1~ X? = (1 ~ X)? (remember that we are working in F,[X]) divides f(X). 
On the other hand, we have X" = X™ (mod X? ~ 1) if N = M (mod p), 
thus 


pri fin 
f(x) 2° | (1) yy 1] X? (mod XP ~ 1). 
j=0 \k=o iBl=k,m(B)=j (mod p) 


Since this polynomial is 0, it follows that its constant term is 0 in F,. But this 
is precisely saying that p divides fo ~ fy + fo ~-+-+(—1)"fy. im 
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The following is also a rather tricky problem. We found it in the wonderful 
little book [62]. It was also proposed in a Chinese Team Selection Test in 2002. 


Fis. For which positive integers n can we find real numbers @i,42,..., Gn 
such that 


4, i n 2 
{la -aj) |1si<jgn}=41,2,..., alt 
Proof, It is clear that if a; are such numbers, then all their differences are 
integers, thus we may actually assume that they are integers. Let 
S(X) = XM 4 XH XM, 


so that the hypothesis can be written 


G  & xG)+1 _ x-() 
Af (ze) =n Bet ree aed Ske : 
i=l i= 


The point is to look at values of f at points on the unit circle, since for 
|z| = 1 we have f(4) = f(2) = f(z), thus f(z)f (4) = [f(z)*? = 0. We deduce 


that for any z = e* we have 


We will take x such that (n? — a+ §= & to deduce that 


1 2(n®—n+1) 


ee ene mnie 
aw 3x 
sin 3 3r 


However, it is immediate to check that the last inequality cannot hold unless 
n <4. And indeed, for n < 4 such sequences exist. For n = 1,2 things are 
clear, while for n = 3 and n = 4 one can take the sequences 1, 2, 4, respectively, 
1,2,5,7. D 
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Proof. It is easy to build examples with n = 1,2,3, and 4. Suppose n > 5 and 
without loss of generality that a1 < a2 <-++ < ap. Then the largest difference 
must be dy ~ a; = (§) and since 


n=1 n-i 
On ~ Gy = So (any — Gp) > - ko C 
k=l k=1 


we see that: the differences ay4.1 —a, for 1 < k <n—1 must bea permutation 
of 1,2,...,n—1. In particular, all differences of a;’s with indices two apart are 
at least n. Suppose @;,44 ~ @, = 1, then if they are present, both az.9 — ag = 
1+ (@ps2 — @ga1) and ayy, ~ apy = 1+ (a ~ ax_1) would be at most n, 
a contradiction since they are at least n and distinct. Thus the difference 
of 1 can only occur at one of the ends a2 — a, or an — ni and it must be 
adjacent to the difference of n ~ 1. Since this accounts for the difference of n 
all remaining differences of a,’s two apart must be at least n + 1. Therefore 
repeating the above argument. shows that the difference of 2 must also occur 
at one of the ends and can only be adjacent: to the difference of n — 1. But 
this is impossible since the ends are at least n~1 > 4 apart. a 


The following gem is one of our favorite problems. 


19. Find all positive integers n with the following property: for any real 
numbers @1,2,...,@,, knowing the numbers a; + aj, i <7 (but not 
knowing which number corresponds to which sum) determines the values 
@1,42,--., 0, uniquely. 

Selfridge and Straus 


Proof. We will eventually prove that the answer is: all positive integers but 
the powers of 2. 

First, let us give a nice counter-example when n = 2*. Let A; = {0,3} 
and By = {1,2} and define inductively 


Aji = AyUQ14B)), Bir = By U (M4 + Aj). 


It is an amusing exercise left to the reader to check that Aj, B; have 2’ elements 
and that 


{a1 + aglay # a2 € Aj} = {bi + dolby # bg € B;} 
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for all j. Actually, A |; and B; are precisely the sets of numbers having at most 
j+1 digits when written in base 2 and whose sum of digits is even (respectively 
odd). All this can be easily proved by induction and shows that Ax, By are a 
counter-example for n = 2*, 

Let us prove now that if n is not a power of 2 and if the collections 
(ai + ag)icg and (6; + b;)icg are identical, then the collections (a;); and (b;); 
are identical. This is the hard part. We may always assume that a;,b; are 
positive real numbers, by adding to each of them the same large integer. 

Assume first that the a;’s and b;’s are integers and consider the polyno- 


mials ra 


F(X) = POX, g(X) =I xe, 
f=] tok 


Then the hypothesis implies the equality 
F(X) ~ F(X?) = g(X)? — 9 X?). 


If f = g, then we are done, so assume that h = —g is not the zero polynomial. 
Write h(X) = (X ~ 1)*p(z) for some polynomial p with p(1) 4 0 and some 
& > 1 (note that f(1) = g(1) =n). Then 


(X ~ 1) p(X)(F(X) + G(X) = (X? = 1) p(x), 
which, after division by (X ~ 1)*, can be written as 
P(X)(F(X) + g(X)) = (X + 1)*p(X?), 


Taking X = 1 in this identity and dividing by p(1) ¥ 0, we obtain that 
n= 2F-1_ Since this is a contradiction, it follows that f = g and the result is 
proved for integers. It follows trivially that the result also holds for rational 
numbers (by multiplying them by a suitable positive integer we can make all 
of them integers). 

In order to prove the general case, we will use Dirichlet’s approximation 
theorem. Suppose that n is not a power of 2 and that the two collections of 
numbers (a; + aj)icj and (bj + bj)icj are the same. Pick any ¢ < je 

By Dirichlet’s approximation theorem, we can find integers p, gi, rj such 
that p # 0, |pa; ~ g;| < ¢ and Ipbi —ril <€ for alll <i<n. By the triangle 
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inequality, any equality a; + a; = 6 + 6; forces g + qj = Ti +r; (indeed, we 
have 


% + dj (Te +05) = Gi — Dai + 45 ~ pag — (re — phi) — (15 ~ pd;) 
and each term has magnitude smaller than 1 /4). Hence the two collections 
(G+ a)icj aud (rj + ;)zcj are the same and so (by what has already been 


done) we have an equality of polynomials TM (X — @) = TT i(X -—1j). This 
can also be written as 


n Ses cute n 
II (x04 = #) =H (x-0+ BoP), 
i=1 P i=1 P 

By taking ¢ smaller and smaller, we obtain SEQUENCES ty, Up, Which converge 
to 0 and such that []T/_,(X —a;+un) = Tini(X — 6; + vp) for all n. It is clear 
that this forces []#_,(X ~a;) = ini (X — b;), which is the desired result, 


The following alternative proof is a very elegant argument due to Selfridge 
and Straus ([70]): 


Proof. Assume that n is not a power of 2. Let us prove that the knowledge of 
the multiset (a; +a yji<g uniquely determines the symmetric elementary sums 
of the a,’s (this in turns uniquely determines the multiset (a;);, yielding the 
desired result). Using Newton’s formulae, it is enough to prove that we can 
recover the sums S, = ak + ak +...4+a* from the multiset (a; + 43 )icj, and 
this for every k (it would be enough to take k < n). Note that 


"  Ya+aj)*= 37 (a +a;)* 25, 


ify IStggn 
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As 2n—2* # 0 for all k, the previous relations show (by induction on k) that Si, 
can be uniquely determined as a linear combination of the sums S7,. ;(a:+a,)* 
and So, 5y,...,S,_1. This is precisely what we needed. i= 


The last two problems of this chapter are very hard. The first one requires 
a few facts about polynomials over finite fields, for which we refer the reader 
to the addendum 9.A. 


Y20. Prove that there exists a subset S of {1,2,...,n} such that 0,1, 2,...,.2—1 
all have an odd number of representations as « — y with x,y € S, if and 
only if 2n — 1 has a multiple of the form 2-4* — 1, 


Miklos Schweitzer Competition 


Proof. Define f(X) = y,cgX*~'. The fact that S satisfies the property in 
the statement is equivalent to the following equality in F2[X] 


XP" 1 F(X) f () S14 X peep XMM, 


Indeed, the coefficient of X? in f(X)f (4) is the number of representations of 
a as a difference of two elements of S. Conversely, if we can find a polynomial 
f{ € F)[X] satisfying the previous equality, we can define a set S with the 
desired property simply by saying that s € 9 if and only if X*—! has coefficient 
lin f. 

Thus, we must find n for which one can find f € F2[X] satisfying the 
previous relation. A first crucial remark is that 1+ X + ----+X?"-? has no 
multiple root in the algebraic closure of F2. Indeed, this is already the case 
with X2"~! _ 1, since this polynomial is relatively prime to its derivative. 
Next, the roots of f are precisely the inverses of the roots of X"—!f (4). We 
deduce that no irreducible factor of 1 +X +-+-+X?"~? can vanish at z and 
1/z for any root z of 1+ X +---- 4+ X22. But conversely, if this property 
holds, we can find such f. Indeed, we can then pair the irreducible factors of 
L+4+X + -.- ++ X"-? such that in each pair the roots of the first polynomial 
are the inverses of the roots of the second polynomial. One can then take for 
f the product of the first components of these pairs. 
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Now, consider the permutation z —} z? of the roots of X?"-! — 1. 
lf C),...,C; are the cycles of this permutation, the irreducible factors of 
X?"~1 _ 1 are the polynomials Tleec,(X — 2). We deduce that the existence 
. Si is equivalent to: for any root z 4 1 of X?e~1 — 1, oe is no k such that 
= 1/z. This is equivalent to (X?"~! — 1, X#*+1_-1) = X —1 for all k >-0 
ta so therefore ged(2n — 1,2* +1) = 1 for all k > 1. Leia the least s such 
that 2n — 1|2* — 1 is odd and writing s = 2k + 1 we are done. Q 


Rather strong analytic skills are required to prove the following result, 
which we found in chapter III of the excellent book [62]. It is there considered 
an “appetizer.” 


21, Let A be an infinite set of positive integers. Let z, be the number of 
pairs (a,b) € Ax A such that a < b and a+b =n. Prove that the 
sequence (2, )n is not eventually constant. 


Donald J. Newman. 


Proof. Consider the generating function f(X} = vaca X*. Note that f(z) 
converges for all |z| < Land z -+ f(z) is a continuous function on the open unit 
disk. Moreover, the hypothesis that z, is eventually constant is equivalent to 
the existence of a constant c and of a polynomial P such that 
P(X — F(X?) = $e + P(X), 

We will prove that this cannot happen. The idea is to look at the behavior 
of f on the real axis, close to 1 and then at its average behavior on circles of 
radius tending to 1. To avoid introducing too many functions, we will write 
f >> q (when f,g are positive functions on (0,1)) if there exists a constant 
¢ > O such that f(r) > g(r) for all r in a neighborhood of 1. In particular, 
we have IPI << 1 and since f(x?) a 0 for 0 < # < 1, we deduce that 

f(z? 2 $+ Pla) and so f(x) >> 5 we So f grows quite fast on the real 
axis, mene of 

Now we integrate the relation satisfied by f on circles of radius close to 

1. Using the triangle inequality, we deduce that for all r € (0,1) 


Qn 
if lf (re) Pde < af LFG@re) Une ane |P(z)| + cF(r), 
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where F(r) = 3 i+ Using Parseval’s identity we obtain 


a aR fre) Pda x > pet — T(r?). 
To 


aca 


On the other hand, Cauchy-Schwarz inequality and Parseval’s identity yield 


Qn f fa 
xf "pote idx < anf, Morte iPae = VF) < VF 


Qn 
Putting these remarks together, we obtain the estimate: 
f(r?) < VF?) + C1 + CaF(r) 


for some constants C,,C2 > 0 independent of r. This immediately yields 
f(r?) << 1+ F(r). If we combine this with the result established in the first 
paragraph, we deduce that eel << 1+ F(r). It remains to prove that this 
is not the case. Note that for 0 < 2 <7 we have 


[1 re™ |? = (L~r)? + dr sin?(a/2) 2 (1 ~r)* + 4r2?/n?, 


hence 


i 


1 ny ( 2V7 
MG aresinh ( ) . 


It follows that lim,_,)- Es < 5 and hence 
lim V/l—rF(r)=0. a 
rer 


Remark 8.8. With basically the same techniques, but with more work, one 
can prove the following beautiful theorem of Erdés and Fuchs: 
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Theorem 8.9. Let A be a set of positive integers and let r(n) be the number 
of pairs (a,b) with a,b € A such thata+6=n. Suppose that for some e > 0 


we have 
r(0) +r(i) +++ + r(n) C4 o( 1 ) 


n+l 
for some constant C. Then C =0. 


nite 
For the origin of this result and a complete proof, see 162). 


8.5 Miscellaneous problems 


The solution of the following problem is quite short, but the problem is 
actually quite tricky. 


22. Is it possible to partition the set. of all 12-digit numbers (leading zeroes 
are allowed) into groups of four numbers such that the numbers in each 
group have the same digits in 11 places and four consecutive digits in 
the remaining place? 


St. Petersburg Olympiad 
Proof. The answer is negative. Assume that we found such a partition and 
let A be the set of all 12-digit numbers. Consider f(X) = 2,24 X*@, where 
s(a) is the sum of digits of a, and let Gi,...,G4 be the groups appearing in 
the partition. By the condition imposed on the structure of each group we 
deduce that <q, ¥% is a multiple of 1 + X + X? +X for any i. Thus 


° k 
14 X4 X74 XF(X) = SOY x8. 


i=] a€G; 
But we can actually compute f in a closed form, since 
xX 1\% 
P(X) = (14 KX ---- 4X9)? = AY) : 


Since this is clearly not. a multiple of 1+ X +X? + x3 (for instance, because 
it does not vanish at i), the problem is solved. oO 
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The following problems are quite challenging. They establish congruences 
using generating functions. The first one is taken from [64]. See also [77], [78], 
[79] for other references and delicate congruences. 


23. Let p be a prime and let d € {0,1,...,p}. Prove that 


pol 

2k ) 
Ss =r (mod p), 
3 fe 


where r = p—d (mod 3), r € {—1,0, 1}. 
H. Pan, Z.W. Sun 


Proof. The key point is to observe that 


ok - ok 
(era) “(+ x) 
(a pyr-t 


es ace wae (x: x) scenes oo 


k=0 


80 


Now, since we are only interested in S (mod p), it is enough to understand 
the previous rational function taken mod p. Since in F,((X)}) we have 


1\” Soak Ve 
(x +3) = (x oe a) : 


we deduce that, S (mod p) is the coefficient of X74 in eek € F,((X)). 


So S (mod p) is also the coefficient of X~4 in 
pe REX PHI 
— X4X141 


X(-X 3 
=A Port x P41) 


= (XP +X? +1) (= xaeth _ Er] : 


n>o n>o 


By inspection of the last product, the result follows. i) 
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The following problem is similar in nature to problem 23, but more com- 
plicated. 


24. Let p > 3 be a prime. Prove that 
prol 
2k 
Oe @) =0 (mod p*). 
kel 
David Callan, AMM 11292 


Proof. Let § = wes wea so we need to prove that S = 1 (mod p*). The 
first key point is to observ that 


(ae 2p? pe 
g=tx xR 4 xy — poy (Lt XP - X 
be) AG +20" = DOL ae 
thus 
S=[xPd 
= (xP! 


The second key ingredient is the following nice 


Lemma 8.10. For all 1 <k < p® we have 


(2) =2(2) on 


Proof. The left- hand side is the coefficient of X* in (1 +X)”. But since 
(L+X ye =14xP (mod p) (this follows by raising to the p-th power the 
equality (1+ X)? = 1+ X? (mod p)), we can write 

(La XY? = 14 X? 4 pA(X) 


for some A € Z[X]. The lemma follows then by taking the square of the 
previous relation and comparing the coefficients of X*, 
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Coming back to the proof, we deduce that S —1 is the same (mod p?) as 
twice the coefficient of X?°~) in 
2 
P +37 
( i ) xk a 


tt) xt = (1-X) > 


pP>ke1jeo 


coe a (8 eee 2 


1sk<p? Bik Nk ptm LS b+ 1 


We need to prove that S; is a multiple of p*. Well, the good news is that 
gin 
S; = 0, since if z= eS, then 


On the other hand, since p? = 1 (mod 6), we have 
L$ 2 sista), 214 aye a l+z, 


which combined with the previous relation shows that S; = 0. The result 
follows. Cl 


The following result is very difficult. It was conjectured by Rodriguez- 
Villegas and proved in a rather difficult way by E. Mortenson. The following 


beautiful proof is due to R. Tauraso, taken from [83]. 


25. Prove that for any p > 2 we have 


p-l 2 
rey) =(-1)'F (mod p*). 


ken 
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Proof. Observe first of all that 


ah (i) _ Tjer(24-1) 


k 


where n= Bet 
We will prove that for all nonnegative integers n we have 


yor(f) (ae) =" 


by computing the generating function of the left-hand side. We have 


ESC) C3)--EOzrcs 


n>0 \k=0 k20 
2k n+ 2k 2k 7 
“pen eee (a) - Booey 
RDO x n> 4: RDO 7 a 
a 
ay xX 1; 1 ( x, i 
=) pale ctaeeet Oe pemecacaeeeare! (fr (eee, (perme 
‘Sie 2 — = 2 
rE (i- xX) 1-xX aG—X) 
= 
“LEX” 
Combining the previous two paragraphs yields the desired result. o 
8.6 Notes 


The following people provided solutions for the problems in this chapter: 
Robin Chapman (problem 24), Prasad Chebolu (problem 14), Darij Grinberg 
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(problem 14), Xiangyi Huang (problems 18, 22), Mitchell Lee (problems 4, 10, 
12), Palmer Mebane (problem 11), Greg Martin (problem 2), Fedja Nazarov 
(problem 21), Fedor Petrov (problem 23), Richard Stong (problems 8, 12, 18), 
Qiaochu Yuan (problems 1, 6, 16), Victor Wang (problem 12), 


Addendum 8.A_Lagrange’s Inversion 
Theorem 


In many counting problems, we start by finding a recursive relation for 
the number of objects we are trying to count, then we consider the generating 
function of that sequence (or the exponential generating function, according to 
the context) and establish a functional equation for it, based on the recursive 
relation. For instance, when counting the number of ways to put parantheses 
in a product of n terms, the associated generating function turns out to satisfy 
the easy functional equation f(z) = 1+ zf(z)®. Of course, in this case one 
can solve this quadratic equation and find a formula for f(z), which in turn 
allows us to find the desired number of ways (which is the classical nth Catalan 
number). But what if the equation was f(z) = 14+zf(z)®? Then surely such a 
method would not work, simply because it is impossible to solve this equation 
using radicals. Of course, one might argue that such an equation is unlikely 
to come up in enumerative combinatorics, but this is also wrong! 

The purpose of this addendum is to present a basic and very powerful 
tool for dealing with such problems (and many more), the Lagrange inversion 
formula. After discussing the very beautiful proof of this result, we will turn 
to applications, which will hopefully show its power. 


8.A.1 Statement and proof of Lagrange’s inversion formula 


Before stating Lagrange’s inversion formula, let us recall a very basic 
result on compositional inverses. 


Proposition 8.A.1. Let K be a field and let F(T) = aT +-aoT?+--- € KUIT]] 
be a formal series with aj #0. Then there exists a unique f € K (IT such 
that F({(L)) = T and it also satisfies f(F(T)) =T. 


Proof. Let. us look for solutions of the equation F(f(T’)) = T having the form 
F(T) = iT + beT? + -+- (it is clear that if f is a solution, then f has zero 
constant term). By comparing the coefficient of T on both sides, we obtain 
by = z. Doing the same for the coefficient of T? yields ayby + agb? = 0 
and, in general, looking at the coefficient of T" we obtain an equation of the 
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form a@1bp + Gn(a1,@2,...,@n,b1,..-,B8n.1) = 0 for some polynomial G,. This 
already shows that if the solution exists, then it is unique, But it also shows 
that a solution exists, since the previous equations can be-solved recursively. 

In order to prove that f(#(1)) = T, observe that f has the same property 
as F (namely its constant term vanishes and its linear term is nonzero). So, 
by the previous Sane. a is a unique g such that f(g(T)) = T. But 
then F(T) = F(f(g(T))) = o(T) and so F = g and f(F(T)) = T. a) 


We are now ready to state and prove the main theoretical result of this 
addendum. The proof given here (due to Hofbauer) is an adaptation of an 
analytic proof that would work over complex numbers. However, using the 
purely algebraic notion of residue of a Laurent series, one can obtain a proof 
over any field of characteristic zero. Recall that if f € K[[T], then [T"1f(7) 
denotes the coefficient of T” in f. 


Theorem 8.A.2. (Lagrange’s inversion formula) Let K be a field of charac- 
teristic 0 and let e be a series in K|[T]| whose constant term is nonzero. If 
FE K|T]] satisfies f(T) = Te(f(T)), then for any g € K{[TI] and any n>0 
we have 


("la F(T)) = AE NEy' TelT)") 

Proof. Let F(T) = ,80 F(T) = a-T+--- for some a € K*, as the constant 
term of e is nonzer 0. om he hypothesis f(T’) = Te(f(T)) becomes F(f(T)) = T. 
Thus f is the (compositional) inverse of F and so f(F(T)) = T, too. If 
A(T) = Meg @nT™ € K{[T]|[1/T] is a Laurent series with coefficients in K 
(so ay = 0 for all n small enough), let res(hdT) = a_y. Obviously, for all h 
we have res(h'dT) = 0. The key point is the following “change of variable 
formula” 


Lemma 8.A.3. [fG € K([T]|[1/T], we have 
res(G(T)dT) = res(G(F(T))F'(T)dT). 


Proof. By linearity, we may assume that G(P) = T*, with k € Z. If k 4-1, 
both G(T) and G(F(T))F'(T) are of the form 9 with g © KU[TA/T) and 
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so both terms of the equality we want to establish are zero. So, assume that 
k= —1, Then? 


FW) 1 e(7T) 1 (: une 


FD) Fat) TE 
80 res ( Fa) = res (%) and we are done again. Oo 
Joming back to the proof of the theorem and using the lemma, we can 
write 
(T"]g( F(L)) = res ( AL) ar ir) 
g( f(F(L))) 
= res (EO regal “F(D)dT 
1 
= ~Sres(o(T)(F(L)"Y'aT). 
As u'v tue! = (uw)', we have res(1! uv) = —res(ue") for all u,v, which combined 


with the previous equality yields 
fe i f 
[I"|g(4(T)) = res (Seer) 


set el FO) ayn 
= Tres (Gre a (P)"dT 


i. 
sr AP N' Delran). 
The result follows. im) 


Here is an easy application of the inversion fon mula, which is not so easy 
to prove by other means: 


“Note that 


pe) 252 
ee 2@) = fe _— 


exists in A(T}, 
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Example 8.A.4, Suppose that two sequences (@n)n,(bn)n of complex numbers 


satisfy ‘: 
bn = Ss a Pe 5 or 


k 
for all n > 0. Prove that 


functions of the two sequences. An easy computation shows that 


B(X) = Bg . (= (, 7 .)*) = deen De) (, A 


k 
k k 2 
=> XFS (S)x"=> a, X*(1 + X)¥ = A(X + X?), 
k “ na Us k 


Let Y = X+X?, so that X = f(Y) with f(Y)=Y- TERT: Using Lagrange’s 
inversion formula, we obtain 


ay = (¥"A(Y)) = [F"1(BUO) = YB) 


-tyyew~a+yy=T Beye #X)=") 
m k 


‘ kb, [ 1 


k 


and the result follows from the easily checked equality 
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8.A.2 Two variations and some applications 


In applications, one also encounters the following versions of the inversion 
formula. For instance, we will use the following result to prove a very nice 
combinatorial identity due to Abel. 


Theorem 8.4.5. Let K be a field of characteristic 0 and let e be a series in 
K{[T] whose constant term is nonzero. Then for all f¢ KUT] we have 


HO) = 40+ 3 (ae) Airmen. 


Re) 


Proof. Let X = X(T) be a formal series such that X = Te(X) (it exists by 
proposition 8.A.1 applied to F(T) = an): Then Lagrange’s inversion formula 
yields 


S(X(D)) = FO+ VII IS(X DT = f()+° “(DG (D)e(TY")-T". 
n=l not 
Now, substitute 7’ = Y/e(Y) to obtain the desired result. G 


Here is the promised application, which is really not easy to prove by 
direct computational means. 


Theorem 8.A.6. (Abel’s identity) For all comples numbers a.a,y and all 
positive integers n, 


n 

(a+ y)” = > Qn + ak) ly — akyP-k, 
kag \™ 

Proof. The desired equality is equivalent to 


n 


(ety) Sy x(@t ak)! (y ~ akye-* 
nt ap k(n By! 


k= 


Let us consider the generating functions of the two sides of the previous equal- 
ity, The generating function of the left-hand side is e@+#)T, The generating 
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function of the right-hand side is 


yy cee ak pm 2a + ak)" rot), 
n O<kEn kl (n —k)! 3 kt 


Thus, by dividing by e87, it suffices to prove that 
1 3 Trk 
FF we Le s Ree tak) le kT pk 
REI 
But. this is a consequence of theorem 8.A.5 with 
f(D) = e*" and eT) = eT, a) 
Let us apply this result to prove the following nice-looking identity, 
Example 8.A.7. Prove that for all n > 1 we have 
oz (ie IG +1 = a(n gaye 
Li20¢jen \* 


AMM E. 2828 


Proof. Take a = 1, © = 1 and y=7-+ 1 in the previous theorem. We obtain 
Ve atte j n 

S (een G +n = naa)", 

itjen ¢ 

Unfortunately, this is not really what we want to prove, but it shows that we 

are on the right track. To get rid of the extra 1 in the exponent of j +1, 


differentiate with respect to y the equality in the previous theorem and then 
takea = 1,2 = 1 andy=n+1. This time we end up with 


n ; 
n(n + 2)? i+ pili; Jd 
maayr= SS (Nernyu+y 
ttjen 
Now, observing that j = J +11, we can write the last sum as 


oa (een g+yy— 3 (Herneg ene, 


itjen ib jun 


Combining this with the first relation, the result follows. 
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Here is a quite exotic problem: suppose that ¢ ¢ K [[Z7] and consider the 
sequence a, = [T"}(e"(T)). What is the generating function of the sequence 
@,? The following result answers this question in a more general context: 


Theorem 8.A.8. Let X, Y be variables satisfying Y = Xe(Y), where e € 
K([T]] has nonzero constant term. Then for any F € K [[7]] we have 


me n n FY) 
ae (F(D)e(T)") XX" = T= xe) 


Proof. Apply theorem 8.4.5 to f(T) = : i Edu (recall that this is formal 


integration, so f(T’) is the unique formal series vanishing at 0 and such that 
f(uje(u) = F(u)). Using that f’ = E, we obtain 


£Y) = Arey Yrerecry) 


n>l 


=D dryer yx, 
nei 


Differentiating this equality with respect to Y we obtain 


FY) _ od sb) fppeph apap nel a 4 OX. 
ay) = ayl0) = DP yer mer yx, 


nei 


Finally, differentiating Y = Xe(Y) we obtain dY = dX - &¥)+Xe'(Y)dY, so 


Replacing this in the previous equality yields the desired equality. QO 


Example 8.4.9, Find a closed form for the generating function of the sequence 
a, where ay, is the constant term of (Q4+X4+ *)". 
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Proof. Note that a, = [Z"(e"(T)), where e(P) = T? +7 +1. So, by the 
previous theorem 


nm 
» anX" = 1-X rea Titay 
where Y = Xe(Y). Solving the equation in Y yields 


1~X—- 


y= 


and then an easy computation shows that 


(J 


So a,x" = Bema 
a Vi ~ 2X — 3X2 


8.A.3 Examples from enumerative combinatorics 


In this section we consider applications of the inversion formula in count- 
ing problems. We start with an absolutely classical and beautiful theorem of 
Cayley, but. we need a series of definitions before stating it. Recall that a tree 
is a connected graph with no cycle. A labeled tree on the set {1,2,...,n} is 
a tree whose set of vertices is {1,2,...,n}. A rooted tree is a tree in which 
one of the vertices (called the root) is distinguished. There is a unique (non- 
backtracking) path between any two vertices of a tree. The parent of a vertex 
in a rooted tree is the vertex connected to it on the unique path to the root. 
A child of a vertex v is a vertex whose parent is v. A tree is called ordered if 
one is given an ordering of the children of each vertex. 


Theorem 8.A.10. There are n™~? unordered labeled trees on the set 
{1250 5.5 Tu}. 


Proof. Let a, be the number of unordered rooted labeled trees on {1,2,..., 7}, 
with the natural convention that ap = 0. It is enough to prove that a, = n""1 
as clearly the number of unordered labeled trees is a,/n. 

The point is that giving such a tree is the same as giving its root and a 
forest of subtrees whose roots are the children of the root. Suppose that the 
root has & children and that the corresponding subtrees have ny, n2,-...% 
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vertices, so ny + o+++++ ng =n—1. The number of ways to distribute the 
n-~ 1 vertices different ‘oti the root in these k subtrees is 


ae eas _  {n-1)! 
ny ny ny! gl 


Once such a distribution is made, we have @n,Gng***@n, ways to label the 
elements of the forest and so the contribution to the total number is 


7Ony Ong *** Ong 


np! 


however each configuration is counted k! times (since we do not care about 
the order of the children of the root) and the main root can be chosen in n 
ways. We finally deduce the very complicated recurrence relation 


i 
mane DS 


* pate pRgon—L 


This simplifies drastically if we consider the exponential generating function 
T(X) = Davo an, since 


if o Te q On, & 
[XP xy SR Ste 
Aye pap aved myl--- mp! 


Hence the recurrence relation can also be written 


[xe] SS — = [XP eM, 
; { 


that is XeT(*) = T(X). Using Lagrange’s inversion formula, we obtain 


of = (MUT(X)) = Apex = 


from where the result follows. 


0 
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We end this addendum with two more difficult examples. The following 
beautiful result is taken from [65]. 


Example 8.A.11. An intransitive tree on the set of vertices’ {1, 2,..., 7} is a tree 
such that for all 1 <i <j <k <n, {i,j} and {j,k} are not simultaneously 
edges. Prove that the number of such trees is 


1 oy (,) yd 
ne grei aes: 
n-2 pea k 
Note that it is absolutely not clear that the above quantity is an integer! 


Proof. Let F, be the number of such trees and let 


= Se Fut oy 


n>o 


be an associated exponential generating function. Call a vertex i left if all of 
its neighbors are greater than ¢. Let Ly be the number of rooted intransitive 
trees on the set of vertices {1,2,...,n}, whose root is a left vertex. Then 
Ly, = 1 and we clearly have Ly, = $F, for n > 2 (n comes from the choice of 
the root, division by 2 takes into account the fact that the probability that 
the root is left is 1/2). If 
TE 
LT) = Ly— 
(T) » na 
is the exponential generating function associated to Ly, then Ly, = 5F, for 
n> 2 yields L(T) = Fa + F(L)). But the exponential formula implies that 
F(T) = e!7), as an intransitive tree on the set of vertices {1,2,...,n + 1} is 
obtained from a forest of left-rooted trees on {1,2,...,n}, by connecting n+1 
to each root. Thus, we obtain F(T) = erttFT) af f(T) = T+ F(T), 
we deduce that f(T) = T(1+e(/?), An application of Lagrange’s inversion 
formula yields 


Fu IT") f(T) = Aireyc etn = Lire 3 (i) AT /2. 


(n— WV! roy 
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If we expand e*?/? and collect terms according to the exponent of T, we finally 
deduce that « 

1 n 1 = 

a= Fogo 2. (i) A 


Finally, a question by James Propp with a nice proof from [69]. 
Ezample 8.4.12, The vertices of a polygon P with N + 2 vertices are labeled 
1,2,1,2,... in order (stopping when the end is reached). Let ay be the number 
of triangulations of P with no monochromatic triangle. Then ay = 527 (*") 


In tT 
if N = 2n and ay = SS (3) if N =2n+1. 


Proof. Define aj = 1. Suppose that N = 2n+1 and call a triangulation proper 
if it contains no monochromatic triangle. Consider a proper triangulation 7 
of P. Note that P has an edge labeled 1,1. This edge must be a side of a 
triangle with a vertex labeled 2. If this vertex is the ith vertex labeled 2, 
with 7 2 0, then the two sides of the triangle split P into a 2i + 2-gon and a 
2n ~ 2% + 2-gon and both of these polygons are properly triangulated by 7. By 
adding over all i we obtain Bone —_ = Veo 02;@2n—2; and a similar argument. 
for N even yields az, = > pire 0  azaan_1-i. This suggests considering the two 
generating functions 


A(X) = ST aanX", B(X) = So aang X". 


n>h n>0 
Then the previous relations can be written in a compact form 
A(X) = 2X(1+ A(X))B(X), BCX) = (1+ ACX))?. 
Indeed, we have 
n 
B(X) = S7 amy X" = > bs sata) x” 


nzo n>0 \i=0 
2 


So aX? ] = (1+ A(X)? 


0 
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and 


‘Qn antl 
A025 (Sawer) area Z ($e wears) x 


n>i nzd Vis 
=X- > (= QA 27 mn Bi. n) XX" 4X. Ss (= QQi4-1 dn +) x” 
na>o \i=t nO \i=d 


= 2X(1+ A(X))B(X). 


We deduce that A(X) = 2X(1+.A(X))? and an easy application of Lagrange’s 
inversion formula finishes the proof, 


8.A.4 Composition of generating functions 


One of the key points in the proof of Cayley’s theorem 8.4.10 is to estab- 
lish the functional equation Xe?) = T(X) for the exponential generating 
function of the number of labeled trees. We would like to give a more ab- 
stract and general context for this kind of argument, which appears very often 
in counting problems. We follow rather closely the wonderful book [76) and 
we strongly advise the reader to take a look at the first chapter of it, which 
contains an impressive number of examples and problems on this topic. 

Let K be a field of characteristic De and let f,g: N -+ K be two sequences 
of elements of K. Let Ey = Sy>9 f(n)*> be the generating function of f. We 
would like to give a combinatorial interpretation of the generating functions 
E;- Ey and Eyo Ey. Note that Ey - Hy = E;,, where 

n 
in) = 90 (2) Fate ~ &) 
k=O 
We deduce that for any finite set X we have 
A(IX|) = S$ FUSHo(IX —$})= $2 FUSi!)g(|S2I), 
SCX (81,82) 


the second sum being taken over all ordered partitions (51,2) of X (and S;, 
Sg may be empty). By an obvious induction, we deduce that 


Ep Epos Ey, = En, 
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where 
AUX) =  S™  fiSi)) - fa(iSal) + - Fe(LSs)) 
($1 ,S2,..-)54) 

the sum being taken again over ordered partitions of X, For instance, con- 
sider the problem of counting the number of partitions (Sy, So,.-.,5%) of 
{1,2,...,2}, where each S; is nonempty. By taking f(n) =1ifn > 1 and 
FO) = 0, we deduce that the exponential generating function for the number 
of such partitions is 


EHX) = (e - k= 1) “(f) 


j=0 


and expanding ¢* yields the desired number of partitions. 
Suppose now that f (0) = = 0. We would like to understand the generating 
function Ey o Ey. 


Theorem 8.A.13. Let f.g: N + K be sequences such that £(2) = 0 and 
g(0) = 1, Then E,o Es = En, whereh: N+ K is a sequence such that 
h(0) = 1 and for all finite sets X we have 


ACIX|) = S” o(k) - FUS1\) « F(Sel) «---- FOSel), 


the sum being taken over all unordered partitions (S;,S2,..., Sg) (with arbi- 
trary k) of X into nonempty subsets. 


Proof. We clearly have h = = RDO g{k)hy, where 
g(r) = 59 PUSil) ~~ £OSi)), 
wT 


the sum being taken over suordered partitions with & classes. Hence it is 


enough to prove that Ej, = a8 But this follows from the previous discussion 
and the fact that we are only considering unordered partitions here (thus the & 
classes may be permuted in k! ways and yield the same unordered partition). ©] 
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Example 8.A.14. Let us consider again the problem of finding the functional 
equation for the exponential generating function of the number of unordered 
rooted trees on {1,2,...,n}. Let T be this generating function. Then by the 
previous theorem Xe™(*) is the generating function for the number of pairs 
(r, F), where r is a root and F is a forest of unordered rooted trees starting 
from this root. But it is clear that any unordered rooted tree arises in this 
way, so we actually have XeT(*) = T(X), 


Example 8.A.15. Suppose that we want to count rooted unordered labeled 
trees such that the number of children of each node is in a fixed set S, con- 
taining 0. The argument used in the previous example yields the functional 
equation 


jxjax AoE 
ses . 


Using Lagrange’s inversion formula, we obtain a formula for the number of 
such trees, 


Example 8.4.16. Let E be the generating function for the number of connected 
graphs with vertices 1,2,...,n. Giving a graph with vertices 1,2,...,n is the 
same as giving a family of disjoint connected graphs (its connected compo- 
nents), thus by theorem 8.A.13 the generating function for the graphs with 
vertices 1,2,...,n is e®. But since there are 22) such graphs, we deduce that 


Example 8.A.17. Consider two sequences f,g such that f(0) = 0, g(0) =1 
and define h(0} = 1 and ‘. 


AUX) = ST gk) FC) FUCal) + PCR, 


weSyin(X) 


where Sym(X) is the set of permutations of X and Ci,C2,... ,Cy are the 
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cycles of c. Then theorem 8.4.13 yields? 


ERX) = By | Sof) 


nei 


Example 8.4.18. For nonnegative integers c,¢2,..., let Gn{C1,€2,..-) be the 
number of permutations o € S, having ¢ cycles of length i for all i <n. 
Consider indeterminates X,,X2,.... Then the previous example yields the 
following cycle-index formula 


: 7 tT 
Gn {C1 ,€2,...) “XP! XSF ++ P Oa ar = OP So Xn: iis 


en 20 


Example 8.4.19. Let us count the number of permutations of odd order, ie. 
for which all cycles have odd length. By taking X; = 1 when 7 is odd and 0 
otherwise, we deduce that the exponential generating function for this counting 
problem is 


nazi 
nm odd 


aries (“ +1) ~ log( =?) 


‘Note that we also have 
AUX) = D7 a(R) £151) (Si] ~ 1)! FS 1) S| ~ 1), 
the sum being taken over unordered partitions 7 of X, since the cycle decomposition of a. 


permutation yields a partition of X and since one can cyclically permute in (jS;} ~ 1)! ways 
the elements of a. class with |S;| elements. 
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and an easy application of the binomial formula yields the mumber of permuta- 
tions f(n) = (1-3---+ (n—1))? when n is even and f(n) = (1-3-.---(n—2))?-n 
when 2 is odd. : 

Example 8.A.20. Let k be a positive integer and let f(n) be the number of 
permutations ¢ € S, such that o* = 1, This is equivalent to the fact that the 
length of each eycle of o divides k. Thus by the previous examples 


8.A.5 More tree-counting problems 


In this section we present another proof of Cayley’s theorem, as well as 
some similar counting problems, all related to trees. The following general 
result is quite useful in problems concerning trees, 


Theorem 8.A.21. Let dj,dg,....d, be positive integers such that dy + do + 
tot dy = 2n—2. Then the number of trees on the set {v1,¥2,...,Un} such 
that verter v; has degree d; is 


Proof. We will prove the result by induction on n. We may assume that 
d,, = 1, by permuting the d,;'s if necessary. Consider a tree on {v1,v2,..., tn} 
such that deg(v;) = d; and remove vertex v, and the unique edge whose 
endpoint is u,. We obtain a tree on {v1,v2,...,U,.1} whose degrees are 
dy,...,dj-1,d; — 1,dj21,...,dn-1 if um is connected to vj. Conversely, any 
such tree on {vj,--.,Un—1} yields a tree on {vy,...,v,} simply by connecting 
vp with vj. It follows that there are 


n 


at 
jm] 


. 


YO Tide 1)! 


such trees and the result follows. 
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Cayley’s theorem is a direct consequence of the previous theorem and of 
the multinomial formula: the number of trees on {1,2,...,n} is 


— (n~ 2)! As (n 2)! 
2 Gop = CU pore te 
bee “fagoin-a ( one (dn ~ 1) in-teHigen—g Ht ta! 


= (Lt bt. 41)? = nt, 


Proposition 8.4.22. There are Ga) ‘(n—1)""*"! labeled trees with vertices 
1,2,...,” in which verter 1 has degree k. 


Proof. This is also an easy consequence of the previous theorem. The desired 
number of trees is 


vo t= 2)! a G > ") + (m—~1)P7e} 
—D!- (dy ~ 1)l-..e- ~ 1! a " , 

Bde tdentin a} (k ~1)!- (dy — 1)! (dy ~ 1)! k-1 
the second equality being a consequence of the multinomial formula. ob 


Let. us introduce a very useful notion in graph theory. 


Definition 8.4.23. Let G be a loopless graph. A spanning forest is a sub- 
graph without cycles and haying the same vertices as G. A spanning tree is a 
connected spanning forest. 


_ Here is a nice application of Abel’s identity. 


Example 8.4.24. There are (n — 2). n?-3 spanning trees of K,, which do not 
contain a fixed edge of K,. 


Proof. Call 1, 2,...,n the vertices of K;, and assume without loss of generality 
that the fixed edge is e = 12. Let f(n) be the number of spanning trees that 
contain e. Such a tree appears uniquely as a result of the following process: 
consider two trees T), 72 whose vertices form a partition of 1,2,...,” with 1 
a vertex of T; and 2 a vertex of T2, and then join these two trees by the edge 
e. If Ty has & vertices different from 1, these vertices can be chosen in (",") 
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ways. Once these vertices are chosen, we have (k + 1)*7! possibilities for Ty 
and (n — k — 1)"~*>? possibilities for T>. Thus 


nd ‘ 
fm= >> & * *) (kK+ DP? mk 1yrrk$ = 2 nt, 
k=O 


the last equality being an easy consequence of example 8.4.7. The result 
ma 
follows. o 


Remark 8.4.25. Here is another approach, suggested by Richard Stong. Let 
X,, be the probability that a randomly chosen spanning subtree of K,, contains 
the edge e. Then by symmetry it is clear that E[X,] is the same for all edges 
e. Since any spanning subtree of K, has exactly n — 1 edges we have 


SO EX) =n 1. 


Therefore since all (3) terms in this sum are equal, 


Hence by Cayley’s formula, there are 2-n"~* spanning subtrees containing e 
and (n ~ 2)n"~? that do not contain e. 


Chapter 9 


A Little Introduction to 
Algebraic Number Theory 


This rather long chapter is concerned with elementary algebraic number 
theory. The techniques are rather diverse: basic linear algebra, algebraic num- 
bers and symmetric polynomials, cyclotomy and p-adic analysis are some of 
the topics discussed in this chapter. Since we will use the notion of algebraic 
number quite often in this chapter, we end this introduction with a few rec- 
ollections. For more details and some proofs, the reader is referred to the 
addendwn 9.B. 

A complex number z is called algebraic if it is root of some nonzero poly- 
nomial with rational coefficients. In this case, there exists a unique monic poly- 
nomial with rational coefficients, called the minimal polynomial of z, which 
vanishes at z and has minimal degree. The roots of this polynomial are called 
the conjugates of z. The crucial property of the minimal polynomial is that 
it is irreducible over Q and divides any polynomial with rational coefficients 
that vanishes at z. A fundamental theorem in algebraic number theory states 
that the algebraic numbers form an algebraically closed subfield of Q, thus an 
algebraic closure of Q. If z is an algebraic number, we let Q(z) (or Q[z}) be 
the subfield of C generated by z. It consists of all numbers of the form f(z), 
with f € Q[X] (or equivalently f € Q(X)). This is a finite extension of Q, 
of degree equal to the degree of the minimal polynomial of z. The primitive 
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element theorem ensures that all finite extensions of Q are of the form Q(z) 
for some algebraic number z. We call such extensions number fields. We will 
frequently use the notation [L : K] to denote the dimension of L as K-vector 
space, as well as the fundamental tower relation [M : K] = [M: L)-|b: K] 
for any finite extensions M/L/K. 

A more refined notion is that of algebraic integer. This is a complex 
number that is killed by some monic polynomial with integer coefficients, By 
Gauss’ lemma, we can characterize algebraic integers as those algebraic num- 
bers whose minimal polynomial has integer coefficients. An easy but funda- 
mental result is that a rational number which is also an algebraic integer is 
necessarily a rational integer. Another important result is that the algebraic 
integers form a subring of the field of algebraic numbers. 


9.1 Tools from linear algebra 


In this section we consider a few applications of linear algebra to num- 
ber theory. These concern especially divisibility issues and linear diophantine 
equations. 


1. Let a,6,¢ be relatively prime nonzero integers. Prove that for any rela- 


tively prime integers w, v, w satisfying au+bu+cw = 0, there are integers 
m,n, p such that 


Octavian Stan&sild, Romanian TST 1989 
Proof. Consider the linear system in the variables m,n, p 
a=nw~ pe, 6=pu-mw, c= mu—nu. 


Trivially, the determinant of this system is 0 and the rank of its associated 
matrix is 2. It is thus enough to solve in integers the system @ = nw — pe, 


p such that vp = —a (mod w) and up = b (mod w). Now, the hypothesis 
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implies the existence of integers A,B,C such that Au + Bu + Cw = 1. We 
deduce that Aua+ Bua = a (mod w). Since au+ bu = 0 (mod w), we deduce 
that (Ab ~ Ba)v = ~a (mod w), so that we can take p= Ab— Ba to get 
up = b (mod w). Also, we can immediately check that 


u(Ab— Ba) = Aub-- Bua = b(Au + Bu) = b {mod w). 


Proof. We will actually prove a stronger result: for any integers a, 6, c and 
any integers u, v, w such that au + bv + cw = 0 and gcd (u,v, w) = 1, there 
exist integers A, B, C such that a = Bw—Cv, b= Cu—Aw ande = Av~ Bu. 
Indeed, since ged(u, v,w) = 1, a standard application of Bézout’s lemma 
yields the existence of integers X,Y, Z such that Xu + Yo+ Zw = 1. Let us 
define 
A=cY ~bZ, B=aaZ—cX, C=bX -aY. 


Then, 


Bw ~ Cu = (aZ ~cX)w ~ (6X ~aY)u 
= a(Xu+ Yu+ Zw) — X(au+ bu + ew) 
=a, 


Thus a = Bw — Cv and similarly b = Cu— Aw and c= Av — Bu. The resuit 
follows. CG 

A very nice and classical result is that [],<;. jen “3 is an integer for 
any Integers @),42,...,@,. There are many proofs of this result, at least two 
of them being presented in [3]. The following problem is a variation on this 
topic. 


2. Prove that for any integers a1, a9,... ;2m the number 


lem(a1,a2,--., a) 
args I] @-«) 


i<icjsn 


is an integer divisible by 1!2!---(n ~2)!. Moreover, we cannot replace 
L!2!--- (mm — 2)! by any other multiple of 1!2!-.- (n— 2). 
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Proof. Consider the matrix A = {ai3}i<ij<n with ay = GS) for j > 2 and 


ai3 = Feminggeatn) We will prove that 


det(A) = lem(ay,@2,..-,4n) Thicicjen (4s 
G1 G2 >-+ Oy 


12!--.(n— 2)! 


Since the entries of A are integers, it is clear that its determinant is an. integer, 
frorn which the first part of the problem will follow, 

Factoring an L = lem(a;,¢2,... +) out of the first column and multi- 
plying the i-th row by q;, it follows that 


ed ty a0 ba afm —2) - m(8)| 
£1 (7). (23 ie L 1 ag ag(ag—1) +++ a(® 


no 


a at : “i : 102 °° Gn : : , : 
OT) Ih ay talon 2) + an (222) 


Taking out the numbers Fi that appear at the denominators of the binomials 
in each column shows that 


La aay ant + 
Py ee iL - Lod, aj—ag +) able... 
O42 +++ Gy ll21.--(m— 2)! |: : : 
I @, G2 —an ably... 


Note that the (i,j) entry of this matrix can be written as P;(a:), where P;, 
0 <j < n—1 is a monic polynomial of degree 7. For each column j add 
a suitable linear combination of the previous columns to reduce the previous 
determinant to 


Lay at + a 

1 a a noe Gg 
(w= 2): : 
2 


ae 
Loan a, +. ak 


Gy +++ ay Zt. 


Since the last determinant is Vandermonde, the identity stated above is proved. 
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To see that the result is optimal, simply choose ay, =: (n!)? and a; = (nl)? +i 
for 1 <i<n. Then 


lem(q1, @g,....@,) = m!nay-+-an—4 
2 iL os 


because the numbers a, 4 = 1,2,...,0—1 are pairwise relatively prime. 
The result follows easily from t a 


“2 and the one in the previous prob- 


Remark 9.1. Quantities such as TT; <j 
lem have natural combinatorial interpretations: they are the dimensions of 
some irreducible representations of special unitary groups. Of course, explain- 
ing this is beyond the scope of this modest book, but the reader should know 


that these are not “just some random problems.” 


Remark 9.2. A similar result is proved in the beautiful paper [6]: if 
aq, @1,-..,4n are integers, then [],_;(a? ~ a3) is a multiple of Beare and 
this result.is optimal. This is also related to the dimension of some irreducible 


representations of the symplectic group. 


We continue with a nice application of linear-algebraic arguments. The 
ideas used in the following solutions are very useful in other contexts, too. 


y3: Let p be a prime and let a1,a9,..., @+1 be real numbers such that no 
matter how we eliminate one of them, the remaining numbers can be 
divided into at least two nonempty pairwise disjoint subsets each having 
the same arithmetic mean. Prove that a, = ag = --- = Ops. 


Marius Radulescu, Romanian TST 1994 


Proof. Subtracting from the a;’s their arithmetic mean (observe that the new 
numbers have the same property), we may assume that Ot dg++ + +Gpiy = 0. 
Fix some i <j < p+1 and let Ci,...,C, be the classes of a partition of 
{@1,..-,@p11} ~ {a;}, such that al Meec, = does not depend on [. Since the 
sur of the a;’s is zero and since 


SEs 


i recy 
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we deduce that we have a linear relation of the form 


1 1 
iol” shoe ie] eu - 


with |C;| <p. 

Now, consider all such linear relations, obtained by making j run over all 
1,2,...,p-+1. This gives us a linear system with p+ 1 equations and p+1 
unknowns (the a;’s), whose matrix has 7 on the main diagonal and numbers of 
the form } with k < p elsewhere. But then the determinant of the matrix will 
be of the form 52 for some rational m = 1 (mod p). Thus, the determinant 
is nonzero and since the system is homogeneous, the only sohution is the trivial 
one. This implies that all a;’s are zero and the conclusion follows. O 


Proof. First, we will reduce the problem to the case when all a; are integers. 
The following method is classical and very useful in a whole variety of situa- 
tions: consider the vector space spanned over Q by the a@,’s. This is a finite 
dimensional Q-vector space and if we take a basis of it and write each a; as 
a linéar combination with rational coefficients of the elements of the basis, 
we easily see that the coordinates of the a,’s also satisfy the conditions of 
the problem (because by definition the elements of the basis are linearly in- 
dependent over Q). Working coordinate by coordinate reduces therefore the 
problem to the case when all a; are rational. Multiplying all a;’s by N! for 
some sufficiently large N reduces then the problem to the case when all a; are 
integers. 

Assume now that all a; are integers and let us prove the result by induction 
on max |a;|. The base case is obvious, so let us focus on the inductive step. 
Removing every element a; gives sets Sig C {a1,@2,..-Ap41} ~ {a;} = A; non- 
empty, pairwise disjoint so that iS ee Sig F is independent of 7, say equal 
to k. Then 


1 
| Ss i aa ya 1Ssa] ps " 


2eUja1 Si3 
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Let |5;1| =m and observe that D541 15:7] =p —m and 


(p-m) bees =m a cs) => S*2=0 (mod p) 


reS 4 BEA S31 reA; 


Summing over all choices of i we obtain 


pit 
Sa =r (mod p) => a@=r (mod p)¥i€ {1,2,...,p+1} 
= 
Thus, we can write a; = pb; +r for some integers 6; But clearly 
{b1, b2,.-.,0pi1} also satisfy the conditions of the problem and moreover 
max [b;| < maxfa;|. By the inductive hypothesis all 6;’s are equal and so 
all a;’s are equal QO 


Proof. Here's a “no formula” proof, which uses the same kind of argument, 
but replaces the choice of a basis in a vector space with an approximation 
argument: first, we reduce to the case when all a; are integers in the follow- 
ing way. By Dirichlet’s approximation theorem there exists a large integer 
M such that all Ma; are very close to some integer A;. The linear equations 
deduced from the fact that the @,’s satisfy the conditions of the problem be- 
come approximate linear equations for the A;’s. But there are finitely many 
such equations and each has rational coefficients. Thus, if at the beginning we 
ensured that Ma; are sufficiently close to the 4;’s, the approximate equations 
in A; are actually exact. Thus the A;’s are integers satisfying the conditions 
of the problem. If we solved the problem over the integers, it follows that all 
Aj are equal. But then any two a,’s are less than 2/M apart and since M is 
arbitrarily large, this implies that all a; are equal. 

Now, let us assume that the a;’s are integers. If we remove one number, 
the common rational arithmetic mean for the rest of the numbers cannot have 
p in the denominator, so the sum of all other numbers is rational with p in 
the numerator and, thereby, an integer divisible by p. Hence all numbers have 
the same remainder modulo p as their sum. Now continue as in the end of the 


previous solution. 0 
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9.2 Cyclotomy 


. * dink : ahs 
There are y(n) primitive nth roots of unity, namely e“=”, where k is 
relatively prime to n. Hence the nth cyclotomic polynomial 


ink 
bn(X) = ] (X —e°®") 
IskSn 
god{&n)=1 


has degree a(n). The splitting field Qe) of @,, is called the nth cyclotomic 
extension of Q. These polynomials and their splitting fields play a very im- 
portant role in many areas of mathematics and gave rise to a whole series of 
very deep results. Their study would require a whole book by itself, so we 
decided to focus only on some very elementary and classical applications. 

Since any nth root of unity in C is primitive of order d for a unique din, 
we get the: 


Proposition 9.3. (Fundamental identity) We have 


XxX" —1=[[o(X). 
din 


This easily implies (by strong induction) that ¢,(X) € ZX] for all n. 
The following result is not trivial and plays an important role in many proofs 
concerning cyclotomic polynomials, We'll also see that a weak form of Dirich- 
let’s theorem follows very easily from it. For a proof of Dirichlet’s theorem in 
full generality, see addendum 7.A. 


Theorem 9.4. Let a be an integer and let p be a prime divisor of n{a)- Then 
either the order of a modulo p is n (and sop =1 (mod n)) or p ditides n. 


Proof. By assumption, p divides ¢,(a), and s: if @ has order k (mod p) then 
h\n. If k <n, then p divides both a* ~ 1 and ot (the second because of the 


fundamental identity and the fact that p divides ¢,(a)). As ged (at ~ 1,5 
divides % by the Euclidean algorithm, p|n and we’re done. 


Remark 9.5. Note that the proof also works for prime powers p. 
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Theorem 9.6. For alin there are infinitely many primes p=1 (mod n). 


Proof. For k > n large enough (it is actually enough to take k > 2) we have 
On{k!) > 1 and so we can choose some Prl@n(k!). Since (0) is 1 or ~1, we 
have én(k!) = 1,—1 (mod &!), which obviously implies that ged(p,,k!) = 1. 
As k > n we get pp > k > n and by the previous theorem we deduce tha: 
Pr = 1 (mod 7m). The result follows. 


For the next three problems we will use a very useful rationality result: if 
r and cos(r7) are both rational numbers, then cos(rm) € (1, =f O}. Let us 
recall the argument: 2cos(rm) = e* + e-** and the numbers e?™ e78™ are 
algebraic integers (they are roots of unity), so 2cos(rm) is an algebraic integer, 
Thus, if it is rational, it has to be a rational integer and the result follows. 
Before passing to the next problem, let us discuss a beautiful consequence of 
the previous observation. We will prove that the only regular n-gons all of 
whose vertices are lattice points are the squares, Indeed, let A, B,C be three 
consecutive vertices of the polygon and observe that 


i “og 4 2 AB? +. 2 y2\2 
i+ cos cos? 27 (AB? + BC AC*) EQ. 
2 n 4AB?. BC? 
Using the previous observation, the result follows easily, We strongly advise 


the reader to look for a geometric proof in order to appreciate the power of 
algebraic numbers! 


4. Let A,B,C be lattice points such that the angles of triangle ABC are 
rational multiples of 7. Prove that triangle ABC is right and isosceles. 


Proof. Note that any angle @ = ZABC with A, B, and C lattice points must 
have tan rational or infinity. To see this note that all lines between lattice 
points have rational or infinite slopes and if tana and tan # are rational (or 


infinite) then so is tan(a — 8) = (anes This implies that 


1 ~ cos 20 


tan? @ = sec? ~ 1 = 


is rational and hence cos 26 is rational. Combining this with the discussion 
preceding the problem shows that cos 2A, cos2B, cos2C are all equal to £1, 
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ahd and 0. It is immediate to check that the condition tan A, tan B, tanC 
tational or infinite says A,B,C must be integer multiples of #/4. Hence the 
only possibility is when ABC is right and isosceles. ' oO 


5. Let a be a rational number with 0 < a < 1 and 
cos(3ma) + 2cos(2ra) = 0. 


Prove that a = 2. 
IMO Shortlist 1991 


Proof. Let « = cosma and observe that the equation satisfied by a can be 
written as 


da? 4 4? 32-250 => (Qe + 1)(Qe*7+2-2)=0. 


OF course, if x = -~ i, we must have a = 3 and we are done. The difficult point 
is to prove that we cannot have 22° + x ~ 2 = 0. If this is the case, then # = 
many values as 2 runs over the positive integers. This will clearly contradict 
the hypothesis that a is rational. But since cos(2"7a@) = 2cos*(2"~!ra) — 1, 
it is easy to prove that we can write 


bn V1? a2 +1762 —8 
cos(2” am) = a, batt * Onda, Ong = rg 


The previous relations yield by induction that an, 6, are odd integers and that 
Gn41 > Gm. Thus cos(2"ra) takes infinitely many values. 


Remark 9.7, In general, let us choose relatively prime integers m,n with n > 2 
and find the degree of the algebraic number 2 = cos (2). Definez =e * a 
primitive n-th root of unity. The irreducibility of the cyclotomic polynomials 
(which is a very nontrivial theorem) implies that z has degree y(n) as an 
algebraic number. On the other hand, we have 


(Q(z) + FY = (Q(z) : QW@)] - [Q() : Q| 
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and we have [Q(z) : Q(z)| = 2. Indeed, 22 = z+ 27}, which implies that z 
satisfies a quadratic equation with coefficients in Q(x), so [Q(z) : Q(a)] < 2. 
On the other hand, we cannot have Q(z) = Q(z), because z is not a real 
number. Putting these observations together, we deduce that # has degree 
2) Using the previous result and the fact that cos (% — x) = sing, we can 
compute the degree of sin an The answer is a bit complicated: if n 3 4, 
the degree of sin oa is etn) if 8 divides n, 2 if ged(n,8) = 4 and ¢(n) if 
ged(n, 8) < 4. 


6. Prove that none of the numbers /n + 1 - ./n for positive integers n can 
be the written in the form 2cos (24) for some integers k,m. 


Chinese Olympiad 


Proof. First, we will find a polynomial with « = /n+1— Jn as a root, 
We have x? = 2n+1~2V/n?+n and so (a? ~ 2n — 1)? = 4n? + 4n, from 
where we easily find that 24 ~ 2(2n + 1)a?+1 = 0. Note that the other roots 
of the polynomial f(X) = X* — 2(2n +1)X?+1 area = +VnFi+ Vn. 
Next, we will find a polynomial with roots z = 2cos BES wet I, be the mth 
Chebyshev polynomial, defined by the equality Ti,(cos.x) = cos mz for all 7. 
Then T;, (§) = cos 2kw = 1. Thus the numbers 2cos 2&2 for k = 0,...,m—1 
are roots of g(X) = Tm (¥%)— 1. These m numbers are not distinct, but 
2cos Be = 2cos Bima ke for 1 < k < m/2 are double roots of this polynomial 
since g achieves a local maximum at these points. Thus these are the only 
roots of g(X). 

If Vn-+1 — Ji = 2cos “£2, then f(X) and g(X) have a common factor 
in ZX]. The only roots of f which lie in the interval [—2, 2] (which contains 
all roots of g) are /n+1—J/n and /n~ fn +1. Therefore this common 


n(n+1) is a square. But this would make 4n(n +1) and (2n+1)? consecutive 
positive squares, a contradiction. im) 


We continue with a very beautiful and classical result ({50]) concerning 
linear equations in roots of unity. 
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7. a) Suppose that a),e2,...,@, are rational numbers and Gis Ca. Ck 
are roots of unity such that a1¢) + a269 +--+ + axCp = 0. Moreover, 
suppose that $0. ai¢; # 0 for any proper subset J of cay peed 
Prove that (7? = Gj" for all i,j, where m is the product of p primes 
smailer than or equal to k. 

b) Let z be a complex number. Prove that there are at most 24” 7 hk. 
tuples (¢1,2,...,¢,) of roots of unity with the following property: 
there exist rational numbers a@1,a2,...,a, such that z= ee anG 
and z # ¥°,<; ai¢i for any proper subset I of {1,2,..., k}. 


Mann’s theorem 


Proof. a) We may assume that a1 = ¢ = 1. Let m be the least positive integer 
such that C7" = 1 for all i and choose a caine factor p of m. If m = pln with 
gcd(n,p) = 1, we will prove that 7 = 1 and p < k. This will imply that m 
divides Then? and the first part of the theorem will follow. Proving this is 
however not a simple task. 


We start with an observation: let 2 = € o and let ¢ be an m-th root of 
unity. We claim that there exists 0 = iB < pand x such that 2? = 1 and 


¢ = 2" -a. This is very easy: if ¢ = € a ; simply choose 0 <r < p such that 
rr = (mod p). 


cae this observation to each ¢, we can write G = 2"a4 with 2,7; 
as above. We have x, = 1 and r; = 0. The equation Sot ag; = 0 can be 
written )7P" b:2', where by = Dy,=1 ti. Note that as € Qe"), On the 
other hand, awe can compute the degree of z over Q(e 7 a), Indeed, observe 
that Q(z, a PY = = ae® m™ ), $0 that 


eM )(2) : (ey = CM)G _ elm) __o’) 
[Qle™"}(z) : Q(e*")] 


(Oe) :Q), elm/p)~— e(p=) 
and the last quantity is P ~ 1 for j = 1 and p otherwise. 

Note that Sore =O 5X" is not the zero polynomial, since otherwise we obtain 
the relation )7,... a4 = 0 for all 0 <1 <p. But the hypothesis yields then 
fan= = @ or {1,2,...,k} for all L. As ry = 0, this gives r; = 0 for all ¢ 
and so ¢,” = 1 for all i, contradicting the minimality of m. 
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If we combine the results of the previous two paragraphs, we see that we 
must have j = 1, as z is killed by the nonzero polynomial VY. yx! of degree 


at most p— 1. But then z has degree p ~ 1 over Qe) (as follows from the 
previous computation) and so 30?" OX’ ' is the minimal polynomial of z over 
Qe). As z is also killed by 14+ X +--+ X?~1, we deduce that these two 
polynomials differ by a constant. In particular, all 4 are nonzero. So for all 
0 <1 < pone can find ¢ such that 7; =. Clearly, this implies that p < k and 
the proof is finished. 

b) Fix a solution z = me 
solution, say z= Sey bj2z;. Thus 


k k 
SS aig - Se biz: = 0, 
im) =5 


but one has to be a little bit careful, as this relation does not necessarily satisfy 
the conditions of (a). However, if we fix 1 <i < k, we can find a minimal 
sub-relation of the previous relation which contains z,. By hypothesis, such a 
sub-relation must contain some ¢;. As the length of this sub-relation is at most, 
2k and as it clearly satisfies the hypothesis of (a), we deduce that 27” = a 
for all ¢; in this sub-relation. Here m = pcan p. So, for any i, 2; can take at 
most km. values and so the number of solutions of the equation in 21, z2,.... z% 
is at most (km)*. It remains to use Erdés’s famous inequality (theorem 3.A.3) 
Tlp<n p < 4" to conclude. oO 


,%G of the equation and consider another 


Remark 9.8. Let a1, a2, 
equation @121 +@g2z9+- 
(21, 22)--.)%n) of roots as unity which satlafies the eduntien anil cue ‘that 
Dies ui2i # O for any nonempty subset J of {1,2,...,.}. Conway and Jones 
[20] improved Mann’s theorem by proving that if a; € Q, then for any non- 
degenerate solution we have zf = he = +++ = 24 = 1 where d is the product of 
primes pi, p2,...,ps such that $77, (p; — 2) < n— 1. Also, in [30] the author 
proves using rather elementary and an beautiful arguments that there are 
at most (n + 1)%"+4)? non-degenerate solutions of the equation. 
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9.3 The gcd trick 


The division algorithm shows that if K C L are fields and if f,g @ K LX} 
are two polynomials, then their gcd is the same if we see f,g as polynomials 
with coefficients in K or with coefficients in L. That is, the greatest common 
divisor of two polynomials is not sensitive to the field in which the coefficients 
of these polynomials live. Combining this observation with Gauss’ Jemma, we 
also obtain that if f and g are monic polynomials with integer coefficients, 
then their ged computed in Q{X] has integer coefficients. This gives a very 
indirect, but sometimes very useful way to prove the rationality or integrality 
of a real number 2: it is enough to exhibit X —< as the ged of two polynomials 
with rational coefficients (respectively of two monic polynomials with integer 
coefficients). The next problems in this section illustrate this trick. 


8. Let a,b be two positive rational numbers such that for some n > 2 the 
number Ya+ V6 is rational. Prove that ¥/@ is also rational. 


Marius Cavachi, Gazeta Matematica 


Proof. Let us write */a+ Vb = c for some (positive) rational number ¢. Then 
¥a is a root of X” — a and also of (c~~ X)" —b, The key point is that it is the 
unique common root of these polynomials. Indeed, if z is a common root, then 
we can write z= %/az, and ¢- z= Ybzy for some nth roots of unity zy, 22. 
We deduce that a+ Vb = waz, + Whe. Since |zi| = 1, the real parts of 
21,29 are at most 1. Passing to real paris in the previous equality then implies 
that 2; = zz = 1 and the claim is proved. Now, since the two polynomials 
don’t have multiple roots, it follows that ged(X” ~ a,(e— X)" ~b) = X — va. 
The result follows now from the ged trick. 0 


9, Let m,n be relatively prime numbers and let x > 1 be a real number 
such that a7” + ik and 2” + a are integers. Prove that 2+ i is also an 
integer. 


Proof. Let a= x” + 4. and b = 2” +--+ and consider the polynomials 
x z y 


o(X) =X?" — aX" 41 = (X™ — 2 ™)(X™ — 2) 
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and 
aX) =X?" — BX" 4 1 (X" ~ oP )(X" — 2"), 


The crucial claim is that 
ged(p,q) =X? ~ (w@+a4)X +1 =(X -2)(X-27, 


Assuming this for a moment, we can conclude that 2 +27! is an integer by 
the ged trick. 

It remains to establish the claim and for that it is enough to prove that x 
and x~ are the only common zeros of p,q (since clearly p, ¢ have no multiple 
root). But if z is a common zero, we have 2” = 2 or 270 = z-” and similarly 
2 = a" or 2" = 2". We may assume (by changing z and z~) that z™ = 2, 
80 that |z| > 1. Then clearly we must have 2” = 2”. But then 2/x is a root of 
unity whose order divides both m and n. Since gcd(m,) = 1, it follows that 
z= 2 and we are done. 5 


The following problem is very similar to the previous problem, but a bit 
more difficult. 


10. Let @ € (0,2/2) be an angle such that cos@ is irrational. Suppose that 
cosk# and cos|(k + 1)6] are rational for some positive integer k. Prove 
that 6 = 7/6. 


USA TST 2007 


Proof. We will actually prove more: it is enough to replace k+-1 by any integer 
{ which is relatively prime to k. The key point is the following 


Lemma 9.9. If cosk@ and cosl@ are rational for relatively prime positive 
integers k,l, then either cos@ is rational or @ is a rational multiple of nr. 
Proof. If cosk@ = p and cosl@ = q, then e is a common root of the polyno- 
mials 

f(X) = X?*—pX* 44, o(X) = X% —agx' a1, 
On the other hand, it is not difficult to check that if @ is not a rational multiple 
of 7, then e# and e~® are the only common roots of f and g. Indeed, all roots 
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9 ast PLes ‘ < 
of f are ett for 0 <j <k and all roots of g are e#@+" withO <7 <1. 
On the other hand, since ged(k,/) = 1, the only solution of the equation 
eke  gtiOs FT iy g Si < k and 0 < jy < I (for some choices of 


Coming back to the proof, the previous lemma shows that @ is a rational 
multiple of 7. On the other hand, we saw in section 9.2 that the only rational 
numbers r € (0, 1] such that cosr7 is rational are r = 0,4, 4,3, 1. We deduce 
that k@ and /@ are integer multiples of 2. Since ged(k,) = 1, Bézout’s lemma 
implies that @ is an integer multiple of &. Since cos @ is irrational, we deduce 
that @ = %. a 


9.4 The theorem of symmetric polynomials 


The proof of the following result is quite elementary, but the result it- 
self is incredibly powerful and useful. If R is a commutative ting and if 
f € RiXi,X2,...,Xp] is a polynomial, we say that f is symmetric if for 
all permutations o of {1,2,...,n} we have 


P(X15.. 6) Xn) = F(Xeay, Xan) 


Recall that the fundamental symmetric polynomials are 


OK = ie Xi, Xig rr Xs 


ISt Sigs ip Sn 
for 1 <k <n. We have the equality 
(t+ Xi)(tE+ Xa)----- (€+ Xn) =e toy 8 + -- ton € REX... Xn). 


Theorem 9.10. (Fundamental theorem of symmetric polynomials.) Let R be 
a commutative ring and let f € R[X,..., Xy] be @ symmetric polynomial. 


Then there is g € R[X1,...,Xn] such that f(X1,....Xn) = g(o1,09,.-.,0n). 


9.4. The theorem of symmetric polynomials 4ii 


Proof. We will use induction on n and inside the induction step an induction 
on deg(f). For n = 1 everything is clear, so assume the result holds for n — 1, 
We now prove by induction on deg(f) the assertion of the theorem with n 
variables. If deg(f) = 0 or 1, everything is clear, It is clear that the polyno- 
mial g(X1,...;Xn-1) = f(X4,..., Xn~1,0) is still symmetric, so by (the first) 
induction it is a polynomial of the form AX ++ Xylay... Xs Xa) 
for some h € RX... :Xpn.1]. Note that the difference 


P(X, 665 Xn) AR tt Xn Xa Xp beet XY NXg Xa) 


vanishes when X, = 0 and is a symmetric polynomial. Therefore this poly- 
nomial is a multiple of X)-.-X,. Applying the inductive hypothesis to the 
quotient between this polynomial and X. 1+++Xy (which has degree less than 
deg f), the result follows. D 


Remark 9.11. It is not difficult to prove that the polynomial g is umique. 
This means that there are no algebraic relations between the polynomials 


Remark 9.12. The theorem also implies that any symmetric rational function 


fE R(X), X2,...,X,) is a rational function in the o;’s. Indeed, let 
o- P(X), Xa,...,Xy_) = P(Xo(1)s Xo(a)s +++ Xen) 


for Pé RIM, X,...,X,]. Then we can write 


fe Po Py 
@ Hees,7°@ 

for seme polynomials P,Q, P;. Since f is symmictric, so is Pi. 

The result follows from the theorem of symmetric polynomials applied to 
P, and to T],0-Q. 
Remark 9.13. We refer the reader to [66], chapter 5 for the proof of the 
following theorem of Lagrange: let K be a field of characteristic 0, If 
f € K(Xy,Xe,... :Xn), let Gy be the set (actually group) of those permu- 
tations o € S, such that f(X},X2,...,Xp,) = FXo0): Xo(2),-++sXo(ny)- IF 
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fig © K(X1, Xa,...,Xn) satisfy Gy C Gy, then one can find a rational func- 
tion A whose coefficients are symmetric polynomials in Xj, X2,...,Xp such 
that g = A(f). 

A very important consequence of theorem 9.10 is the following result, that 
will be constantly used in this section. 


Corollary 9.14. a) Let f € Q[X1, Xo,...,Xn] be a symmetric polyno- 
mial and let g € Q[X] be a polynomial of degree n, with complex roots 
24 22y.0+52n. Then f(21, 22,.--,%n) € Q. 


b) If f has integer coefficients and if g is monic with integer coefficients, 
then f (21, 22,-.-,2n) is an integer. 


Proof. Using theorem 9/10, we can write 
F(X1,---.Xn) = Alon, 02, -.-,On) 


for some h € Q[X1,...,Xn] (resp Z[Xi,...,Xn]). The result follows from 
the fact that o;(2z1, z2,..., Zn) are rational (respectively integers), because the 
coefficients of g are so. Q 


Another very useful result is the following generalization of Fermat's little 
theorem. 


Corollary 9.15. Let f € ZX] be a monic polynomial with complex roots 
21; 22)+++42n (multiplicities counted) and let p be a prime number. Then 


a4 hte + 22 = (zy 4+ 29+-++ +2)? (mod p). 


Proof. Corollary 9.14 implies that both sides are integers. Consider the quo- 
tient by p of the difference between the left-hand side and the right-hand side. 
Using the multinomial formula, it is easy to see that this quotient is a symmet- 
ric polynomial with integer coefficients in 21, z2,...,Z, thus the result follows 
from corollary 9.14. O 


Here is a nice application of the previous corollary. It was one of the 
difficult problems given in the Romanian IMO Team Selection Tests in 2004. 
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11. Let a,b,c be integers. Define the sequence (tr)n>0 by ao = 4, ay = 0, 
t= 2c, &3 = 8b and p43 = atno1 + brn + €Zn+41. Prove that for any 
prime p and any positive integer m, the number fpm is divisible by p. 

Calin Popescu, Romanian TST 2004 


Proof. Let ri,r2,73,r4 be the roots of the characteristic polynomial of the 
recurrence relation, namely X4 — eX? — bX —a. The crucial point and by far 
the hardest step in the proof is to realize that 


tarp try tre try 


for all n. This is suggested by ao = 4 and by the fact that problem creators 
tend to try to be sneaky.’ Proving the previous formula is immediate by 
induction, once we prove it for n = 0,1,2,3. For n = 0,1 this is trivial, for 
n = 2 follows from the identity 


2 
Sor? a (= ri) = 23> rir; =2c¢ 
i<j 
and for n = 3 we can use the recursive relation (since it is easy to see that 
Yn = TY + ry + rf +r} together with y_, == —b/a satisfies the same recursive 
relation as z,). With this closed formula for the general term of the sequence, 
we need to prove that }\r?” is a multiple of p. Since Sori = 0, the result 
follows from corollary 9.15 and by induction on m. im 


Let us consider now a few more or less direct applications of theorem 9.10 
and of corollary 9.14, 


12. a) Let P,R be polynomials with rational coefficients such that. P #0. 
Prove that there exists a non-zero polynomial Q € Q[X] such that 
P(X)IQ(R(X)) 

b) Let P, R be polynomials with integer coefficients and suppose that 
P is monic. Prove that there exists a monic polynomial Q € Z[X] 
such that P(X)|Q(R(X)) 

Iranian Olympiad 2006 
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Proof. The idea is very natural: the first condition that should be satisfied in 
order to have P(X MQ(R(X)) is that for each root z of P we have Q(R(z)) = 0. 


Therefore, if a, 22,.-.,%p are the roots of P (some of the x;'s may be equal), 
then we would like to have Q(R(x)) = 0. The most natural choice is to take 
n 
Q(X) = T](X ~ R(x,)). 
isl 


Note that it satisfies P(X)IQ(R(X)), because X — % divides R(X) ~ R(x,) 
for all 4, It remains to check that @ has rational (respectively integer, for the 
second part of the problem) coefficients, This follows from corollary 9.14. 0 


y 13. a) Let a1,02,... ain, By, by, ++ :'n € € be such that 


fi(X)=[]X-a), pcxy= nes ~ bi) € Z[X]. 
t=1 i=l 


Suppose that there exist 91,92 © ZX] such that figs + fag. = 1. 
Prove that: 


| | 
| 
= i. 


Tl TIt@-»,) 


jiml jal 


b) If a. are integers and 


mon 


i Ta ~5) 


je list 


1, 


prove that there exist. polynomials 91,92 € ZLX] such that fig, + 
fog = 1. 


Tbero-American Olympiad 


Proof. a) Note that the relation to be proved can also be written as 


fle = 1. 
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Evaluating the relation fig, + foge = 1 at a; yields f2(a;)92(a;) = 1. Thus 


[[p@)!- TT oe) 


it ii=1 


=1. 


On the other hand, TJ, fe(a:) and TEE; g2(ai) are integers, by corollary 9.14. 
These two observations are enough to conclude. 

b) Note that |e; — ;| = 1 for all i, 7, because a;,6; are integers. It is then 
immediate that we have only two possible cases: 

1) A, B are singletons of the form {a}, {a +1} or {a+ 1}, {a}. 

2) A= {a} and B = {a—1,a +1} for some integer a or B = {a} and 
A= {a-l,a+t}. 

Thus, by symmetry in A and B and by making a translation of the variable 
X + X —a, it is enough to consider the cases when A = {0}, B = {1} and 
A= {0},B = {-1,1}. In each case f; divides some X" and fo divides some 
(X?—1)™. Thus f; divides X?* and fy divides (X™* ~ 1)" for k,n sufficiently 
large. It is thus enough to find Bézout relations with integral coefficients for 
the polynomials X?* and (x oe 1)”. But this is immediate. oO 


Remark 9.16. The assumption that a; and ® are integers is useless. Here is a 
proof, due to Richard Stong. We advise the reader not familiar with the notion 
of resultant to read the discussion before problem 27 in chapter 12. It is not 
too difficult to check that the resultant of fy and fo is THE jet (b;-a;) = £1. 
But then the map 


aa ZLX deg fo)—1 x ZLX laegtfi)~1 aa ZLX dog f1)+des(fa)—2 
defined by y(g1, 92) = g1(X)fi(X) + go{X)fo(X) is invertible, thus we can 
find g:, gz € ZX] such that fig: + fog = 1. 
A classical problem is to prove that 
i 


ja + bV2| > Vatlal + lop 


for any integers a,b, not both of them equal to 0. The idea is that it is not 
clear how to deal with |a + 6/2] directly, but it is very easy to say something 
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about the product of this number and its conjugate ja — bV2|. Indeed, this 
product is a nonzero integer, thus at least 1. The result follows immediately. 
With a similar idea, it is not difficult to prove the following:absolutely classical 
theorem of Liouville: if z is an algebraic irrational number of degree d, then 
there exists ¢ > 0 such that for all integers p and g we have |z —- a > | > iat 
That is, irrational algebraic numbers are badly approximable with rational 
numbers. A much deeper result, for which Roth won the Fields medal, is that 
we can improve the previous inequality to |: _ 2| >= & for alle > 0. The 
following problems use this trick of multiplying by conjugates and estimating 
the conjugates, but they are much more challenging than the very simple 
example discussed above. 


V14. Let k,n be positive integers and let P(X) be a polynomial of ti n 
with all coefficients in the set {—1,0,1}. Suppor that (X ~ 1)4|PCX) 
and that there exists a prime q such that ;4 3 <I me ay: Prove that the 
primitive complex roots of unity of order g are roots of P. 


IMC 2001 


Proof. The problem looks rather complicated because of the strange inequality 
imposed on g. Let us forget first about that and ur the product of 
all values of P at the primitive gth roots of unity, Wi } P(2;). This is an 
integer, by corollary 9.14. If it is not 0, then []?7; | |P(z:)} = 1. However, by 
assumption there exists a polynomial Q iscesary with integer coefficients) 
such that P(X) = (1 ~ X)*Q(X). Therefore 


q-l 
II Plz, Tle — x)*- Tle 2). 
=] I é=1 
Since 
q-i 
XT KTP X41 = TY (X - 2), 


i=} 
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we have [[#7} (1 — %) = q and so 


q-l 


q-k 
TP) = 4 TT 
t=] tel 


But the same argument as before shows that chan Q(z) is a nonzero integer, 
therefore i |\P(zi)| 2 a. This is however impossible, since by assumption 
we have |P(z;)! <n +1, therefore 


g~l 
Ul [Plz < (n+ 9 < g* 
é=1 
The previous arguments show that P vanishes at one of the primitive roots 
of unity of order g. But since the polynomial 1+ X +----+X%-! is irreducible 
over the rational numbers, if P vanishes at a primitive root of unity of order 
q, then it also vanishes at all the other roots. This ends the proof. Cl 


One needs some gymnastics if one wants to avoid the use of Galois theory 
for the following problem. 


15. Let p be a prime and let m1,n9,...,nx be integers. Define 


= 
Prove that either $= 0 or S > k(t)? 
Holden Lee 


Zin 
Proof. Let z =e. The crucial ingredient in the proof is the following: 


Lemma 9.17. The number 


is an integer. Moreover, N = 0 if and only if S = 0. 
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Let us admit this for a moment and see how we can finish the proof. 
Assume that S 40. Then [N| > 1. So 


and the conclusion follows. 
Now, let us prove the lemma. As is well-known (and easily proved by 
induction) there are polynomials Fj; € Z[X] of degree j such that X7+X~S = 
Eo 
F(X + X71), Let F = S95, Fay. Then N= [U4 F(z! +274). The lemma 
will be proved if we prove the following result: 


Lemma 9.18. The minimal polynomial of z+ 2~ is 


7. 
f(X) = Te ~(2427) = Foci + Foss $00 
Proof. Note that 


Fy (X-AK 2) 
X+X)=]] 7 
ta] 


Thus f = Pot +Fy,-3 +--+ by definition of the polynomials F;. In particular, 
2 : 

f has integer coefficients and so N is an integer (by the fundamental theorem of 

symmetric polynomials). Moreover, f has degree ash and vanishes at z+27). 


But 
_ (1): p= 
(Q(z) :Q@t24jp* 2° 


so f must be the minimal polynomial of z+ 27, Oo 
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Now, assume that N = 0. Thus, there exists 1 < 1 < at such that 
F(z! +27!) = 0. By the lemma, F is a multiple of f and so F vanishes at 
z+271, But this means that S = 0, a contradiction. Thus, we have proved 
the crucial claim and the result follows. Oo 


The following result is certainly classical, but it is rather difficult to find an 
elementary proof in the literature. We follow one of the approaches proposed 
in the beautiful article {10} 


W116. Let ai,a2,...,4n be positive rational numbers and let ky, ko,.-.,kn be 


integers greater than 1. If al! fs tai! Ra, tat ko is a rational number, 


then any term of the previous sum is also a rational number. 


Proof. It is clearly enough to prove the following result: let k > 1 and suppose 
that the positive rational numbers @1,...,0n,61,..., ba satisfy 


ay 4/51 +s + an Yon EQ. 


Then 4/6; € Q for all i. 
Let ; 
A; = {roots of X* ~ abby} = {wia; 4/8; |1 <7 < k}, 


where w is a primitive root of order k of 1. Also, let 


and 


By theorem 9.14 we have P € Q[X]. Note that P(a: 4/1) = 0. Let d be the 
least positive divisor of k for which i/o € Q (it exists, as to € Q). If we 
manage to prove that d = 1, it will follow that {/b; € Q, so we can delete the 
first term of S and conclude by induction on n. So, let us prove that d= 1. 
By definition, we can write a; /b; = ¢/& with x € Q.. The crucial fact is the 
following: 


420 Chapter 9. A Little Introduction to Algebraic Number Theory 


Lemma 9.19. X¢— 2 is irreducible in Q(X]. 

Proof. If F is a monic polynomial with rational coefficients, of degree between 
land d—1 that divides X* — 2, all roots of F have absolute value {/z and so 
1F(0)| = (4/2) is a rational number, that is v poet) € Q, contradicting 
the minimality of d. o 


Since P( Vz) = 0, the previous lemma yields X¢ ~ x | P in Q[X]. Thus, 
if z is a primitive root of order d of 1, we have P(z~/x) = 0 and so there are 
(22,...;2n) € Ag x Ag X--- x An with S—2Vk = 29 +-+-+2py. If d > 2, then 
Re(z) < 1, so 


Re(S$) = $ 
= Re(z Vr + tg +++: + 2n) 


< Re(z ¥e) +S Jail 
ime 


nm 
= Re(z x) + Ss a; ¥/ bj 
i=2 


< Va+ Sai Voi 
t=2 
= 8, 
a contradiction. So d = 1 and a; /b; € Q. a 


9.5 Ideal theory and local methods 


We strongly advise the reader not familiar with algebraic number theory 
to read the appendices on number fields and p-adic numbers before reading 
this section, which is short but rather challenging. 

We start with a beautiful result of Polya concerning linear recurrence 
sequences. Recall that a sequence (¢,), Called a linear recurrence sequence 
if one can find d and 2,...,%q¢ such thabans¢ + idnig-1 +++: + tadn = 0 
for all n. 
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X 17. Suppose that (@,},>1 is a linear recurrence sequence of integers such 
that n divides a,, for all positive integers n. Prove that (4) is also a 
linear recurrence sequence. 


Polya 


Proof, By the general theory of linear recurrence sequences, we can find dis- 
tinct nonzero algebraic numbers 21, 22,...,2m and polynomials fi, fo,..., fm 
with algebraic coefficients such that 


Gn = film)zp + folr)2g +--++ fnl(r)en 


for all n. We will prove that if nla, for all n, then f,(0) = 0 for all i, from 
which the result follows easily. 

Let K be the field obtained by adjoining to Q all z;’s and all coefficients 
of the polynomials f;. Choose a prime p which does not divide any of the 
norms of the (nonzero) coefficients of f;’s or the norms of one of the z;’s. All 
sufficiently large primes satisfy this property. Fix such a prime p and consider 
a prime ideal J of K over p, with norm N(J) = p!. Impose the condition that 
gps divides aps. Note that 


m 


apt = S- fila} {mod J), 
i=1 


: J 4 
since 2? =z; (mod J) and since p € J. Thus, we must have 


for all 7 = 0,1,-..,m— 1. Seeing this as a linear system in the f;(0)’s, it 
follows that f,(0) € I for all i, unless I divides the determinant of the matrix 
associated to this system. However, this is a Vandermonde determinant in 
the 2;‘s and so, if we ensure that J and Thigs (2 — z;) are relatively prime, we 
will be able to conclude that f;(0) € J. But to ensure the last property, it is 
enough to choose a prime p which does not divide the norm of the algebraic 
number Th .j(#i — 2j)- Again, all sufficiently large primes have this property. 
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The previous paragraph shows that we can find infinitely many primes p 
and for each such prime an ideal J over p such that f;(0) € J for all 4. But then 
P will divide the norm of f;(0) for infinitely many primes ‘p and so fA0) = 0 
for all i. Tt is then clear that a is still a linearly recurrence deciienoe. 


We present two approaches for the following challenging problem: a rather 
exotic elementary one and a more advanced approach which uses standard 
facts about number fields and their p-adic completions. 


— 
Y 18. Let ars -++)@m be complex numbers such that af + af*-+-- shay is an 
integer for all positive integers m. Prove that (X — a1)(X 
sm. E ; = age t(X — 
ay) € Z{X]. os ss 
bec ak ~ 
Gholiak Pode Nike 


Proof. Let 
on = > 


ISt1 <ig< <ig sn 


Michael Larsen, AMM E 2993 


Qi, Mig °° Gig 


and P, = ak + ak +... + ak, 5 °s identiti i 
5 a®. so Newton's identities? 2 
ed Me can be written (for 


Py 0 Phy + 9Pe2 —--: + (—1)* kag = 0. 


ie pun erases ann pipes ars if P, € Z for all k, then one 
ek Sk< : i)(X —ag)---(X ~ an) € 4Z[X] 
and so n!- a; are algebraic integers. Observe that if },49,-..,0n satisfy the 
conditions of the problem, then so do aj,a),...,a% for all r >L We deduce 
that nila} .is an algebraic integer for all +. The next lemma shows ‘that all 
4,’s are algebraic integers, so the coefficients of (X —- a,)(X — ag)+++(X — an) 
are algebraic integers. Since these coefficients are rational numbers (this ae 
already been established), they must be integers and the result follows. 


Lemma 9.20. Let n be a positive integer and a be an algebraic number. If 


na® is an algebraic integer for all positive integers k, then a is an algebraic 
integer. ; 


?See the remark 9.22 for a proof of these. 
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Proof. If d; is the degree of the algebraic number a’, it is clear that dye > doksi 
(because Q(a"*’) © Q(a?*}). Thus there exists an integer j and a positive 
integer d such that da. = d for all k 2 j. Let a, = a?’,az,...,a¢ denote 
the conjugates of a” and choose a positive integer ¢ such that fy = c(X ~ 
ay)+++(X ~ ag) € Z{X] is primitive. Then go = c(X + a1) ++ (X + aa) € ZX] 
is also primitive, so by an easy application of Gauss’ lemma fi = A(X - 
az)---(X = a2) € Z{X] is also primitive. Since a? has degree d and since 
deg f; = d, it follows that f; is irreducible over Q. Repeating this argument, we 
obtain that f, = c?"(X —a2”).-.(X - 43") € Z[X] is primitive and irreducible. 
Next, since na?’ is an algebraic integer, we have hy = n*(X — a2")---(X = 
at) = (nX —na?")---(nX ~ nad’) € ZX], so we must have i € Z. Since 
this happens for all sufficiently large r, it follows that ¢ = +3 and so a’ is an 
algebraic integer. As a result, a is an algebraic integer and we're done. i 


Proof. This proof uses rather heavy material, but it is much more conceptual 
than the previous one. Namely, we will use a local-global principle, stating 
that an algebraic number is an algebraic integer if and only if v(x) > 0 for 
any valuation v on Q, ‘This follows easily from the relations between a number 
field and its completions (see the addendum on number fields), but the result 
is not obvious at all. Anyway, once we have this, the lemma is immediate: if 
v is a valuation, then we know that v(n) + ku(a) > 0 for all k. Dividing by 
k and making & > 00 yields v(a) > 0, which is enough to ensure that @ is an 
algebraic integer. im) 


The lemma is proven, and so we are done. OG 


Remark 9.21. The case when all a;’s are rational numbers is much easier and 
is a rather folklore problem. In this case, the problem reduces easily to the 
following: if p is a prime number and if a1, a2,...,@% are integers such that 
p” divides af + a3 +-+- + ag for all n, then p divides all a;’s. This follows 
easily from Euler’s theorem, by choosing n = y(p™) with N sufficiently large. 
Actually, using ideal theory as in the previous problem and imitating the proof 
for rational numbers, one can give yet another solution of the problem. We 
leave this ds a nice exercise for the reader. 
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Remark 9.22. Let us recall the proof of Newton’s relations. Let a; be elements 
of a field K of characteristic 0 and define 


Let Ph = af + ak +. ak, Observe that 


£(X) 


n 


identifying coefficients in the equality 


f'{X) =—-f(X)- yy Pyxko 


yields Newton’s relations 


Mbm + Pybmet t--+ + Pind = 0 


forl<m<n. 

The following is also a very tricky problem. We use a p-adic approach 
to solve it and we refer the reader to the appendices on p-adic numbers and 
number fields for more details. 


€ 19. Let p, ¢ be prime numbers and let r be a positive integer such that q|p—1, 


q does not divide r and p > r?—!, Let a1, a9,...,a@, be integers such that 
Bet aot ead 
a,* +49 +----+ ay? is a multiple of p. Prove that at least one of 


the a,’s is a multiple of p. 
J. Borosh, D.A. Hensley, J. Zinn, AMM 10748 
Proof. Let z= ef" and let K = Q(z). This is an extension of degree g~ 1 of 
@. By choosing a prime dividing p in the ring of algebraic integers of K and 
completing K with respect to this prime, we obtain an extension of the p-adic 
valuation vp on K’. Moreover, if z is an algebraic integer in K, then vp(x) > 0. 
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Assume that no a; is a multiple of p and let 2; = 27! (1 <i < q). Since 


g 
3 = tp(a?! —1) > 0, 
jul 
est 
there exists o(7) € {1,2,...,q} such that up(a;% — 24) > 0. Since we 


oa 


have vp, (S.0)* 


, X71, we have A f(z)) > 0. Since f(z;) is an algebraic integer for all 
i, tt Sitios’ that ep f(D Tae F(zi)) > 0. Let N = [] fy f(2:). N is an integer, 
because it is a symmetric polynomial expression with integer coefficients in 
the roots of the polynomial X9~! +.-.4+X +1. We claim that N is nonzero. 
Otherwise, there exists 2 <4 < q such that f(z;) = 0. The irreducibility of 
the polynomial 1+ X +.-.+ X¢~! over the rational numbers implies that f 
is a multiple of 1+ X +---+ X€1, But then r = f(1) is a multiple of g, a 
contradiction with the hypothesis. 
So N is a nonzero integer and vp(rN) > 0, so that vp(N) > 0 (clearly p 
does not divide r). Thus |N| > p. On the other hand, we have | f(z)! <r, so 
that |N| <r?! < p. This contradiction finishes the proof. o 


> 0, it follows that up (SJ zag) > 0. So, if F(X) = 


Proof. Here is a more elementary, but still very tricky solution, based on 
the theorem of symmetric polynomials. Suppose that none of the a,’s is a 


multiple of p and let h = g aa » where g is a primitive root mod p. We 
pat 
can therefore find positive integers m, such that a," = h™ (mod p), Let 


fHx™ 4X 4...4 X™ and let g € Z[X] satisfy 
F(X1) F( Xa) F(X qaa) = GON, Xa. X ger) oe(X 1X20 Xqut))s 


where o; are the symmetric fundamental sums. Let 21, 22...., Zq-1 be the 
complex roots of 


tenes (X =A) € Fp[X], 
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as h,h®,...,h7? are distinct gth roots of unity in Fy. This implies that 
Gi(21,22,-. +, 2yn1) Zai(h,h?,...,h7-1) (mod p) 


for all i. Therefore 


qu 


TI f(z) = olen R.A, qe (hh? AI7)) 


that is p divides the integer N = f(z1)---++ f(zg-1). As in the previous 
solution we obtain |N| <r?! < p and so N = 0. We conclude as in the 
previous solution. 0 


9.6 Miscellaneous problems 


It is really not easy to solve the following problem without the use of 
minimal polynomials. However, once the yoga of minimal polynomials is un- 
derstood, the argument is rather standard. 


20, Find the least positive integer n such that cos = cannot be written in 
the form p+ /q¢ + W/r with p,q,r €Q. 
O. Mushkarov, N. Nikolov, Bulgaria 


Proof. For n < 6, explicit computations show that cos 2 can be written in the 
desired form (the argument, is a bit tricky for n = 5, but note that z= e3 isa 
solution of the equation 24 ~ z? + 2° — z +1 = 0, which can also be written as 
(24271)? -(2+27!)-1=0.) The question is whether we can write cos = Fin 
the form p+/q+ ¥r with p,q,r € Q and the answer turns out to be negative, 
implying that the answer to the problem is n = 7. 

Let us assume that 


cost = pt Vat vr 
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and first compute the minimal polynomial of cos ¥. In order to do this, we 
will first ung a rational equation of low degree satisfied by cos 4. Let z =e 7, 
so that 27 = —1 and 


2 


B24 24 2342? 2 41=0. 


Dividing this by 23 and rearranging terms yields 
1 i 1 

B+ (#+3) +2+--1=0. 
4 2 a 


i 
Thus, if z = cos = ihe, then the previous relation gives 
8x3 — Ga — (4a? — 2) 422-1 =0, 


that is 82° — 42? -- 4 + 1=0. Since the polynomial 


ys iy, 1 1 
L(X)=X —3% 73X45 
is trivially irreducible over the rational numbers (it has degree 3, so we only 
have to look for rational roots), this is the minimal polynomial of x. Therefore 
«and 2 ~ p have degree 3 over Q. 
But then (observe that the identity ((\/@+ Wr) - /@)° =r easily implies 
that /9 € QL /7 + ¥F)) 


vat Vr): (Q(V9) : Q = [Q(ya+ 7): Q = 


and since [Q(/@) : Q] is 1 or 2, it follows that /f is a rational number. Thus 
e-~p-V@= Yr andu=p+ V@ is a rational number. Now, since z is 
irrational, we must have <r irrational and so X® — ¢ is irreducible over the 
rational numbers. Since f(u+ @/r) = f(x) = 0, it follows that X3 — r divides 
f(w+ X) and so (for degree reasons) we must have f(u +X) = X?—r. It 
is trivial now, by identifying coefficients, to see that this is not possible. The 
result follows. G 
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Proof. As in the previous solution it is enough to show that we cannot have 


cos 5 = p+ a+ vr. 


As before we compute that cos } satisfies 8c* — 4? — 4x + 1 and that this 
polynomial is irreducible since ie has no ay roots. Also either by noting 
that the other two roots are cos 32 7 and cos % or by plugging in a few values, 
we see that this polynomial has three real roots, 

Now let z = €?"/9 and suppose « = p+ /@+ 2* YF. Then 


~pE Ja = )8 + 3q(x — p) ¥ (8(@ — p)? + VG. 
So ; 
((2 =p) + 34(¢ —p) 7}? = B(x ~ pF +0) 
Thus 


G(X) = [(X — p)? + 3q(X ~ p) ~ rl? — (8X —p)? + @)?a € QLX] 


is a sixth degree polynomial with roots p +: /q@ + 2* ¥r. If the equality above 
holds, then ee polynomial must be a multiple of f(X), the minimal polyno- 
mial of cos} = p+ /G+ Yr. However f(X) has three real roots and g(X) 
has only two real roots (the ones with k = 0). Thus this cannot occur. aa) 


We continue with a very beautiful problem and a very elegaut solution. 


21. Let s1,82,... and t),f2,... be two infinite nonconstant sequences of ra- 
tional numbers such that (s; ~ s;)(4 ~ ¢;) is an integer for all i,j > 1. 
Prove that there exists a rational number r such that (s;—s;)r and fot 
are integers for all ¢, 7. 


USAMO 2009 


Proof. We start with some useful reductions: first of all, by working with the 

sequences (s;— 1); and (t; ~ ¢1),, we may assume that 3; = t) = 0. Secondly, 

there is u such that s, 4 0 and, by working with the sequences (s) and 
n 


(Su+tajn, We nay assume that 5, = 1. 
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Now, by assumption spt, is an integer for all mn. But then 
8ib5 + 8jty = sity + sjty — (8; — 85) {te “ t;) 


is also an integer for all 4,7. Since sity + jt; and (s;t;) - (s;t;) = (s:t;)(s;t;) 
are integers, 8;£; and sj; are algebraic integers. Since they are also rational 
numbers, they must be rational integers. Thus s;t; is an integer for all i, 7. 
For i = u, we obtain that all tj are integers. Let d be their greatest common 
divisor. Then clearly s is an integer for all 7. We claim that ds; is also 
an integer for all 7, which will solve the problem. But since d is a linear 
combination with integer coefficients of some t;’s (by Bézout’s lemma) and 
since sit; € Z for all i,7, it is clear that ds; € Z for all 4. The conclusion 
follows. 0 


In order to motivate the next problem, we will discuss first a very classical 
and nontrivial result in elementary number theory. The reader is advised to 
read the addendum 9.A before reading the proof. 


oon 9.23. (Lucas-Lehmer) Define @ sequence by ag = 4 and any, = 
a —2 forn > 0. Let m be an odd posttive integer and let n = 2" —1. Then 
n is a prime if and only if nlam—2. 


Proof. The first difficulty is to actually find a manageable formula for the 
general term of the sequen We use the identity 2? +27? = (a +x -1y3 -2 
and set ay, = tn +271 for a sequence tp > 1 (note that an > 2, so 2p exists). 
Then gp41 = eed 80 Dp = ze and we easily conclude that 


= = (24 V3)" + (2- V3)", 


Suppose that n = p is a prime and m > 3. Since p = 1 (mod 3) and 


(mod 8), the quadratic reciprocity law implies that (2) = 1 and 


(2) =-1. Pick some a in an algebraic closure* of F, such that a? = 3. 


i One “does not need the existence of an algebraic closure to prove the existence of a: if 3 
is a quadratic residue mod p, it is der what we have to do; otherwise, it is easy to check 
that F,[X]/(X? ~ 3) is a field with p? elements and we can take for a the i image of X in this 
field. 
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Note that @ is actually an element of F,2 and that we can define a map 
f: ZV3] > Bye by f(a +bV3) = a+ ba, where & = a (mod p) (seen as an 
element, of F,2). Since a? = 3, it is immediate to check that this is a ring 
homomorphism. Trivially, f vanishes on pe. Let x = f(24 V3) =2+a 
and y = f(2-— V3) =2~—a. Thus t,y € F,2 and they are nonzero, since 
cy = f(i) = 1. We want to prove that F(@m—2) = 0 or equivalently that 

i‘ = 0, ie. 2°F = <1, Since 2r = (1+ 0)?, we obtain the 
following equality in F,: 


+ 2 4 4 
ils () -0°F = (22)"8 = (14a)?! = (14a)(1 +0), 


Since a? = 3, we have a? = (3) ‘a= ~a, which combined with the previous 
equality yields the desired result. 

Let us prove the converse now. Suppose that nlam_2, we need that n is a 
prime. It is enough to check that for all pln we have p> /n. Since Pp divides 


m2, the previous arguments yield the equality (2+a) “FS fin F*,. Thus 
2+a6€ Bre has order n+ 1 and Lagrange’s theorem yields n + Up? —1. The 
result follows. QO 


22. The sequence ao, a), @3, ... is defined by ag = 2 and Okpy = 2az — i for 
k 2 0. Prove that if an odd prime p divides Gn, then 2"*5 divides p? —1, 


IMO Shortlist 2003 


Proof. Note that 2a, is precisely the sequence studied in theorem 9.23, so 


an = 


Let now p > 2 be a prime factor of @, and let a € Fy, be such that a? = 3. 
Define f,x,y as in the proof of the previous theorem. Since Plan, we have 
x” 4 y?” = 0, thus 22""" = —1. Hence x has order 2°** in the group Fy 
and so by Lagrange’s theorem 2”*? divides p — 1. Unfortunately, this is not 
enough, but we are close. 
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ifee Fi. everything is easy, since then Lagrange’s theorem for this 
subgroup yields 2"*?|p—1 and so trivially 2°*3{p?— 1, So, assume that z is not 
in F). Then «, y are roots of the irreducible polynomial X?— 4% +1 F,[X], 
so that we must have 2” = y. Indeed, since x? — da + 1 = 0, we also have 
(by raising the previous equality to the p-th power and by using the formula 
(a + y)? = x? + y?, valid in fields of characteristic p) @P — 42? +1 = 0, so 
that x? is also a root of X? ~ 4X +1, which cannot be x (because otherwise 
2? =a and x € F*), Thus 2” = y and so x?! = 1. But then 242 which is 


the order of x, must divide p+ 1 and we are done again. a 
There is really no obvious approach to the following rather exotic problem. 


23. Let k be a positive integer and let a1,a9,..., ay, and by, bg,...,b, be 


two sequences of rational numbers with the property: for any irrational 
mumbers 21,22,...,2% > 1 there exist positive integers nj, n9,... it 
and m),m2,..., 7, such that 


ay [xf] + ag[e}?] 4... 4 age) = by fe] + ba [ef] te by [ary*}. 
Prove that a; = b; for all i. 
Gabriel Dospineseu, Mathlinks Contest 
Proof. The key point is the following result: 


Lemma 9.24, For any integer N > 2 we can find irrational numbers a,b > 1 
such that for every positive integer m we have [a] = ~1 (mod N) and (eX = 
0 (mod N). 


Proof. We will choose a,b to be algebraic integers of degree 2. Let us show 
how to construct @ and leave to the reader the details for the construction of 
6. We want to find a polynomial with integer coefficients 


F(X) =X? 4ux +u=(X~a)(X~e) 


for some irrational numbers a > 1,0<e< 1. In this case, since a™ +c” ig an 
integer for all positive integers m, it follows that fa] = a + c™ — 1 for all 
m. Thus, we need to ensure that a” +c™ =0 (mod N) for all m. Since 


qQittt 4 emt — ~ufa” Ee ey) = ofan + rasa 
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for all m, it is enough to ensure that N divides u,v. Also, to ensure that 
0<c¢ <1 we will choose v > 0 and i+u+z <0. For instance, we can take 
us —2N,u= N, yieldnga=N+/N?—N., 

Similarly for 6, we will choose u == —(2N + 1) and v = N, 80 


QN+1+V4N7 41 
Be in im} 
2 
Coming back to the proof, choose a positive integer N and a, irrational 
numbers as in the lemma. Set 2; = a and 22 = -:- = 2, = 6. By hypothesis, 
we can find positive integers ni, n2,...,m, and m,,m2,...,7™m, such that 


ayo] + ap[zt?] +--+ age t®] = bya] + bole] +--+ bel. 


By the properties of a and 6 we deduce that a, = by (mod N). Since N was 
arbitrary, it follows that a, = 6;. Since we can do the same with the other 
pairs (a;,5;), the result follows. im) 


The following result is really a mathematical gem, taken from [5]. It is 
quite difficult. and has a very elementary proof. 


24. Prove that if py, p,..., py are distinct primes and if 


Q4/Pi + da/pr +-+++ dny/in = 0 


for some rational numbers a1, a@2,...,@n, then a; = 0 for all i. 


Besicovitch’s theorem 


Proof. We will prove by induction on n the following statement: for any m > 1 
and any distinct primes 91,2, ...,@msPiy Pas --. Bn We have! 


VG" Gn ~ QU Pts VP2; +--+) Pr) 


extension of fields and if 2,,22,...,%m € K, we let F(x, ©2,...,%n} be the 

smallest subfield of K which contains F and 21,22,...,%n. It is also the set of elements of 
the form f(a1,€2,...,¢n), where f is a rational function in nm variables with coefficients in 
F. 
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Let us prove the base case: assume that n = 1 and that 
VER Gn = a+ d/pr 


for some rational numbers a,b. Squaring this relation and using that Vin is 
irrational, we deduce that ab = 0. But then either q1q@2-++@m OF G1q2°*-GmPi 
is a perfect square, which is clearly not possible. Now, assume that the result 
holds for n and let us prove it for n +1. Let F = Q(/1, Ba, ---, Pn) and 
assume that \/9192°-"Gm = @ + b\/fnsi for some a,b € F. Again, we square 
this relation to deduce that 


However, by the inductive hypothesis we have \/pnyi ¢ F, so we must have 
ab = 0. If a = 0, we obtain that VPrH1GiG "Ga € F, contradicting the 
inductive hypothesis. If b = 0, we get a similar contradiction. In all cases, the 
inductive step is proved and the conclusion follows. a 


Remark 9.25. In [58], Mordell proved the following generalization: 


Theorem 9.26. Let K © L be fields of characteristic 0 and let x1,22,...,¢» be 
elements of L such that for alli there exists a least positive integer nz such that 
a;* € K. Suppose that for all integers e1,€2,...,€p, if x? Egress a eK, 
then n; divides e; for all i. Finally, suppose that L ¢ R or that K contains 
all n;th roots of unity, for alli. Then (22 : 2B tttee e ocizen; is a linearly 
independent set. In particular, |K(a1,@2,...,¢,): K] = nie++-: Ry 
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Addendum 9.A Equations over 
Finite Fields 


This addendum is a modest introduction to finite fields and polynomial 
equations over finite fields. There are some very beautiful and extremely deep 
results on the subject, which are far beyond the scope of this book. But the 
fact that their proofs are very difficult should not be a reason for not presenting 
them. We highly recommend the introductory text [43] for more details. 

To avoid spending teo much time on preliminaries, we will fix a prime 
mamber p and an algebraic closure F, of the field F, = Z/pZ. Recall that 
this means that any x € F; is a root of some nonzero polynomial f € F,[X] 
and that any f ¢ F,[X] hee at least_one root in F, (which actually implies 
that it splits into linear factors over F,). It is a rather nontrivial theorem of 
Steinitz that any field has an algebraic closure and any two algebraic closures 
are isomorphic. We take this approach when introducing finite fields since it 
is pretty rapid, though not very elegant... 

Before proving the first fundamental result, let us glorify the following 
easy result, which will be constantly used in this chapter: 


Proposition 9.A.1. Let p be a prime and let A be a ring such that" pa = 0 
for alia ¢ A. Then for all powers q of p and for all a,,a2,...,4, € A we have 


(ay +g +++-+4n)o al tages. tal. 


Proof. By induction on n, we may assume that n = 2. Then everything follows 
from the usual binomial formula, the hypothesis on A and the fact that Q= 0 
(mod p) for any 1 <i < q. G 


If ¢ is a power of p, let 
F, = {x € F, |x? = x}. 


We have the following easy, but crucial result: 


Theorem 9.A.2. Fy is the unique field with q¢ elements contained in Fy. 


®We say that A has characteristic p. 


: 
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Proof. First, let us check that F, is a field. It is clearly stable by multiplication 
and stability under addition follows from the previous proposition. Fy has q 
elements since X? ~ X splits into linear factors over F, (because Fi is alge- 
braically closed) and all of these linear factors are distinct (because X7 — X 
is prime to its derivative ~1). 

Let us consider now a subfield L of F, with g elements. As L* is a group 
with ¢ —1 elements, Lagrange’s theorem yields 27”! = 1 for all z € L*. Thus 
a? = x for alla € L and so L C Fy. A cardinality argument finishes the 
proof, q 


A more subtle result is the following generalization of Gauss’ classical 
theorem on primitive roots modulo prime numbers. 


Theorem 9.4.3. F} is a cyclic group of order q-1. More generally, if K is 
any field and G is a finite subgroup of K*, then G is cyclic. 


Proof. Let d be the maximal order of the elements of G. It is a general property 
of finite abelian groups that if c,y € G have orders m,n, then one can find 
2 € G with order Iem(m,n) (the reader can take this as an easy exercise). 
Using this, we deduce that the order of any element of G divides d. Thus for 
all g € G we have g4 = 1. But the polynomial X¢ ~ 1 € K[X] vanishes at al 
elements of G, so d > [G|. On the other hand, d is the order of some element 
of G, so d{|G| by Lagrange’s theorem. Therefore d = |G| and G is cyclic. 


There is a trap concerning finite fields: it is not true that if n > m, then 
Fym C Fyn. Actually, this inclusion takes place if and only if X?"—!—1 divides 
X?"~1 _ 1 (this follows immediately from the definition and the fact that the 
roots of X¢~ X are simple) and this happens if and only if p™ — 1 divides 
p" ~ 1, which in turns happens if and only if m divides n. 

A fundamental object in the theory of finite fields is the Frobenius map 


Fro i Bye > Fy, Frg(x) = 2%, 


an automorphism of Fy» which acts as identity on F,. Moreover, any such au- 
tomorphism is an iterate of the Frobenius map and there are precisely n such 
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automorphisms.® All these results would be pretty hard to prove without the- 
orem 9.A.3, but they become easy exercises once we have it. The following 
result is fundamental. It says that if you know a root of an irreducible poly- 
nomial over Fy, then the other roots are obtained by successively applying the 
Frobenius map to that root. 


Theorem 9.A.4. Let f € Fy[X] be a monic irreducible polynomial of degree 
n and let « © F, be a root of f. Then the roots of f are z,x%,x7,...,27 
In other words, f(X) = TING (X ~ 2). 


a: 


Proof. The key point is that 29" = x. Indeed, the field generated by « over 
F, (inside F,) has g” elements, because x has degree n over F,, so this field is 
Fyn. But in Fyn all elements are roots of X97" ~ X. Having done this, define 
the polynomial G(X) = J]q'(X — 2%). The key point and proposition 9.4.1 
yield 


n-1 end 
G(X)? = T[ (xt = 2?) = T] (xt - 2%) = a(x. 
i= t=O 


Thus, if we write G(X) = go + 9X +--+ + 9X", then again by proposition 
9.AT 
G+ GXT + + GX = gy + XI + WX, 


which implies that gf = g; for all i and so g € F,. Thus G é€ Fy[X]. Since 
G vanishes at x and f is irreducible, we deduce that f divides G. A degree 
argument, finishes the proof. ia 
9.A.1 Norm and trace maps 


Consider a finite field F, and a finite extension Fyn. Define the norm and 
trace maps by 


n=l 

7 3 

Nan jt, Bye Fy, oe ] | at, Tre yn /Fq iF Fp, ore } a? 
=0 


°In fancy terms, the Galois group of the extension Fyn /F, is cyclic of order n and gener- 
ated by Fr. 
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The following result summarizes the basic properties of these maps, that will 
be used in future sections. 


Proposition 9.A.5. The norm and trace maps are surjective maps from Fan 
to Fy. The norm map is multiplicative and the trace map is additive. 


Proof. To avoid complicated notations, write N and T for the norm, respec- 
tively trace map. First, let us check that N(x), 7'(x) € Fy for all 2 € Fyn. 
It is enough to see that N(x)? = N(x) and T(x)? = T(x), For N(x), this is 
clear since x?” = x, while for T'(a), this follows from proposition 9.4.1 and 
the equality 2?" = 2. It is clear that N is multiplicative and proposition 9-A.1 
shows that T is additive. It remains to prove the surjectivity of these maps. 
Let € be a generator of FF and let u be a generator of Fjn. There exists 
a € Z such that € = u®, As €9! = 1, we have u%4-) = 1 and so there is 
an integer b such that a = b- a But then € = N(u®) and the surjectivity 
of N follows. For the trace map, this argument does not work, however we 
note that T(ax) = aT (x) for any a € Fy, and any & € Fyn. Thus, it is enough 


vanish on all of Fy. Q 


9.A.2 Characters of finite fields 


As F,= is an n-dimensional vector space over F,, the choice of a basis 
yields a group isomorphism Fp. ~ Fp x --. x F,. Now, basic properties of the 
dual of a group discussed in section 7.A.1 yield the following result. 


Proposition 9.4.6. Let q be a power of p. There is an isomorphism of groups 
a ta between Fy and its character group, where 


Also, Fj is a cyclic group, so its group of characters is also cyclic of order 
q~1. The following result will play an important role in the following sections, 
when we will compute the zeta function of a diagonal hypersurface. 
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Proposition 9.4.7, Let d be a divisor of q—1. The map X > Xn, where 
Xn(z) = X(N Bn jm, (2)) induces a bijection between characters of order d of Fy 


and characters of order d of Btn. 


Proof. The fact that yn is a character of order dividing d is a consequence of 
the multiplicativity of the norm map. The fact that it has order exactly d and 
that x + Xn is injective is a consequence of the surjectivity of the norm map 
(proposition 9.4.5). It remains to check the surjectivity of x -> Xn. Let ¥ be 


a character of order d of F gr and let u be a generator of F*,. Then € 
is a generator of F¥ and since ¥(u)1"! = 1 (because X? = 1 and d|g—1), there 
is a unique character y of F q Such that x(€) = X(x). By construction, ¥ = xn 
and the result follows. 


Just as for Dirichlet characters, it is convenient to extend the definition of 
a multiplicative character y of F) to Fy, by defining (0) = 0 if y is nontrivia’ 
and (0) = 1 if y is trivial. The following innocent-looking identity will play 
a crucial role in future arguments and is constantly used when dealing with 
equations over finite fields: 


Proposition 9.4.8. Let d be a divisor of q—1 and let x € Fy, The num- 
ber of solutions of the equation yt = x with y € F,, denoted N(y* = x) 
is Daytet x(z), the sum being taken over all multiplicative characters whose 
order divides d. 


Proof. If x = 0, this is clear, as both sides are equal to 1. Assume that x #0. 
If the equation y* = x has a solution in F,, then it has exactly d such solutions, 
as the equation y = 1 has precisely d solutions in Fj (because d|q -- 1 and Fy 
is cyclic of order q~ 1). On the other hand, the dual group of F7 is also cyclic 
of order g — 1, so the equation y4 = 1 has d solutions and for each of them 
x(z) = x(y2) = x(y)¢ = 1, so both sides of the equality we want to prove 
are equal to d and we are done. Finally, if the equation has no solution, the 
result is a consequence of the orthogonality relations (theorem 7.A.5) for the 
abelian group Fy/ {e%|z € Fj}, whose dual group is precisely the subgroup of 
those multiplicative characters y such that x4 = 1 (actually, this argument 
also covers the previous case... ). oO 
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Finally, the following result, will be used in the proof of the Davenport- 
Hasse relation, to which the next section is devoted. It is by no means specific 
to finite fields, but the short proof we are going to give uses properties of finite 
fields developed in the previous sections. 


Proposition 9.A.9. Let x © Fy» and let 
f= Xm ay Xb 4... + (~1)!ag € Fy[X] 

be its minimal polynomial over Fy. Then din and Tijro (X a?) = fa. In 
particular, Ne» x,(z) = ad and Trp yn /B,(@) = Gar. 

Proof. Since [Fj(x) : Fy] = deg(') = d, we have F(a) = Fe (this uses theorem 
9.A.2). But then Fja C ify and, as we have already remarked, this implies 
that din. Next, for degree reasons it is enough to prove that g = ny (X-2”) 
has only one irreducible monic factor, namely f. But if h is such a factor, then 
h has some root x? . But proposition 9.4.4 implies that f also vanishes at xc? 


(note that: the cited proposition applies only for j < d, but we have 2 = x 
anyway, since we have seen that F,(x) = Fj«). Thus ged(f,h) is nonconstant. 


9.4.3 Gauss and Jacobi sums, the Davenport-Hasse relation 


Gauss and Jacobi sums play a fundamental role in the theory of equations 
over finite fields and in number theory, in general. We give here their basic 
properties, that we will need in the following sections. But before doing that, 
it is convenient to define them... 


Definition 9.4.10. 1) If % and y are characters of F,, respectively FE 
the associated Gauss sum is 


Ix.0) = S> x(x)v(e). 


zeke 
2) If x1 and x2 are characters of Fj, the associated Jacobi sum is 


JOasx2) = SS xi(w)xaly)- 


a yeFy ctyal 
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Theorem 9.4.11. If x and are nontrivial, then \g(x,)| = /@. 


Proof. The orthogonality relations (theorem 7.4.5) yield (using also the sub- 
stitution { = t) 


loxP? = S> x@/yvle-y) = S> x@v(ylt-1)) 


ayers tyeF, 
=> xt) (x wy - ))- :) = 3 xO@- ler - 
EF} ¥eFy teRy 
=q-1~ So xQ=9- Vixo=¢ 
t40,1 teF5 


Corollary 9.A.12. If x and w are nontrivial, then 
9069) 9071. #) = x(—Da. 


Proof. This is just a long string of obvious computations, using the previous 
theorem and the fact that g(y, #(~:)) = x{~D g(x, ¥) (which is immediate by 
definition and the fact that r + —z is a permutation of Fj). More precisely, 
we have 


9071.0) = a(%¥) = 9009) = 90, 0(—)) 


ia 


One has the following beautiful result. which connects Gauss and Jacobi 
sums. Note the striking similarity with Euler’s famous formula 


P@)E@) 


FG Tay)’ 


gasrsseistit 
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where ~~, 7 H 
T(z) = f eld, Bla, y) - | el — 4) de, 
o a 
the integrals being convergent for Re(z), Re(y) > 0. 


Theorem 9.4.13. If x1, x2 are nontrivial characters of Fy such that x1 x2 
is nontrivial, then for all nontrivial characters of Fy we have 


ee %) 


Proof. This is a rather tricky computation: 


SSS xxl - 2)xaoy). 


eeF,~{0,1} yer; 


JO, x2)(xix2. 0) = 


Using the substitution a = zy and b = y(1 — 2), this becomes 


SS xila)x2(o(a +b) = a0. Wo(xa 8) ~ SZ xa(e)xo(—2). 


abe sath A0 acs 


As 1X2 is nontrivial, the orthogonality relations (theorem 7.A. 5} yield the 
desired result. O 


Here is a striking application. Assume that p = 1 (mod 4) is a prime. As 
F} is cyclic of order p ~ 1, there exists a nontrivial character y; of order 4 of 
Fp. Let x2(z) = 
that |JOa,x2)|? = p. On the other hand, it is clear that X1 takes only the 
values 0, +1, 4:4, thus J(x1, x2) € Zi]. In particular, |J(x1, x2)! is the sum of 
the squares of two integers. We recovered thus Fermat’s celebrated theorem 
that any prime of the form 4k + 1 is the sum of the squares of two integers. 

We end this section with a much deeper result, the famous Davenport- 
Hasse relation. This is a quite strong identity between Gauss sums, which is 
crucial for the proof of Weil’s theorem 9.A.24 that we will see a bit later on. 
It also has relations to the Langlands program, but that is really beyond the 
scope of this book. The proof is very ingenious. 


A) be Legendre’s symbol. The previous two theorems imply 
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Theorem 9,A.14. (Davenport-Hasse} Let x and & be nontrivial characters 
of Fy, respectively Fz. Then —gn(x,¥) = (~9(x,¥))", where 


on(x,0) = >> Nigar (2) WT (2) 


EF yn 


Proof. As Ng... /e,(x) and Trg,, /¢,() only depend on the minimal polynomial 
of « and not on its roots, we will partition Fyn into collections of conjugate 
elements over F,. Define 


MX — byX44 4... + (-1)%ba) = x (ba) o(br) 


for any 6; € Fy. It is easy to check that A(fg) = A(f) - A(g) for all monic 
polynomials f,g ¢ F,[X]. Combining this with proposition 9.A.9 shows that 
if x € Fyn has minimal polynomial P = X¢ ~a,X¢-1 +--+ (—1)%aq over Fy, 
then 


X (Mego te(2)) ¥ (Tre ye re(@)) = x(aa)"/4v (Sar) = (Py. 


Summing over all conjugates of x and then over collections of conjugate ele- 
ments in Fyn yields the crucial identity 


gx) = SO aypy4, 
d=mdeg Pin 


the sum being taken over all irreducible monic polynomials P € F,[X] whose 
degree divides n. 
To exploit this relation, consider the Z-function 


" i 1e} 
Loy <I] rahe = EN 


Here the product is taken over all P € F,[X] monic irreducible, while the 
sum is over all f € F,{X] monic. The second equality follows from multiplica- 
tivity of A and the unique factorization theorem in F,[X]. Taking log and 
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differentiating, we obtain 


Te 
7 ay oP 2 logtits 


L(T) at 
= SOS deg Pya(pyrre seat?) 
Fon>) 
-5( So apyr/4) 7, 
n  \d=deg(P)|In 


Combining this with the key relation of the previous paragraph, we conchide 


that 
ee aR =Lant yr", 


Finally, we will show that L(T') has a very simple expression. Note that 
LT) =14+ 01 SO xy |r. 
n>t \deg fun 


On the other hand, 


s Ai = “a) = Lix(awi ywla) = g(x), 


deg f=i 


while for n > 2 


DAN = SS x(n)(ar) = 9g"? Svar) - ST x(an) = 0, 


deg fan 3 yigO 
by the orthogonality relations (theorem 7.4.5). We deduce that 
LP) =1+90v)T 


and the result follows immediately from this and the last equality of the pre- 
vious paragraph. ia 
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We end this section by stating a very deep result of Dwork, a consequence 
of his proof of the rationality of the zeta function of algebraic varieties. It is 
a vast and very difficult, generalization of the Davenport-Haase relation. 


Theorem 9.A.15. Let f,g € F,[X! and let x, be multiplicative, respectively 
additive characters of Fy. If 


Sn = S> x (Neg m(f@))) 8 (Tey /e,(92))) ; 


2eFgn 


then there exist polynomials P,Q € C[T] such that P(Q) = Q(0} = 1 and 


9.4.4 Diagonal equations and a theorem of Weil 


Using almost everything we have done so far, we can prove the following 
beautiful theorem of Weil. The true beauty of the result will be revealed in 
a next section, when we will use this result and the Davenport-Hasse relation 
to compute the zeta function of a diagonal hypersurface. Before stating the 
theorem, we need some notation. Let ag,az,...,a¢ € Fy and let vo, x1,+++Xt 
be multiplicative characters of Fj. Consider the additive character o(x) = 


ee Fret) ond let 90a) = 90x, b). Finally, define 


" — 90) 90a) aaa) 
Wales Xts---oX0) xo(@o) xi (ar) xia)” 


Note that this quantity depends on the a;’s, but we suppress the dependence 
from the notation, as we will consider the a;’s as fixed elements, while the 
characters x; will vary. We are now ready to state and prove: 


Theorem 9.4.16. (Weil) Let ao,a1,...,a; € Fy and let X be the projective 
variety defined by agaf! + aizj" + +--+ aaj" = 0. Then 


i 
rae 
|X (Fy)| = So? + j 5 We (x0. Xty-+ + Xt): 
320 


XOX Lert 
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the sum babs taken over all nontrivial characters x; of Fy such that x? = 1 
and Xoxis++Xe= 1. 


Proof. Let M be the number of solutions (x9, 21,-.., 27) € Fi of the equation 


agzg ++ ms ony = 0. Since we work with a projec tive variety, we have 
iX¢ FI = = Moh ; 80 it remains to find M. Note that 
M= = lagagt+-taae'=0 
B05. C1 CF gy 
= SS lawson $7 depen 1 Lepony 
UO UEC y LORE 
= ys N(eq = uo). <--> NP = wu), 
au) 


where we wrote for simplicity a-u = 0 for agug +a, +--+ + ayu; = 0. Using 
proposition 9.A.8, then expanding the product and re-arranging terms, we 
obtain 


i 
Me ST ( Moued}= + ( > rleaala)~-ae)) 


ausOj=0 \yp=1 XPHlvosjet \au=0 


Next, note that 


4 
D2 xo(%0) xa (un) ++ xe) = T] x9 (a3) Solos ---.x0)s 


aeu=0 j=d 
where 
Jo(xos X41 +66 Xt) = 2 xo(uo) xr (ta) xr(u). 
toby =O 


So, we end up with the pretty complicated formula 


i 
M= Dy Tiles) Joo, 32). 


Mee 


Xp Smet jad 
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It is convenient to study in more detail these sums Joxor Xi. xD 
which are generalizations of the Jacobi sums discussed in section 9.A.3. The 
following lemma deals with those terms in the surn defining M7 for which some 
character is trivial. 


Lemma 9.A.17. Suppose that the trivial character appears in the list 
Xo:Xis---,Xp. Then either xj = 1 for all j, in which case 
Jo(xo. xa, xd =¢, or Joyo, xa,---.x2) = 0. 


Proof. If all xj = 1, it is clear that Jo(xo,x1,--- -x1) = ¢ (don’t forget that 
the trivial character evaluated at 0 yields 1 by convention). Assume that not 
all x; are trivial, say (without loss of generality) yo = Xi == Xe = Land 
x; #i for j >k. Then 


To(Xo; X14--+5 x2) 
= SS wenn) x(a) 


Ug eUpb tay) 


=O SY) xeea tesa) xi(u) 


URE Ue 
i 
=@ TT | Soxsby) 
joker uy 


the last equality being a consequence of the orthogonality relations (theorem 
7.A.5). Cl 


Using this, we obtain 
i 
Mad+ S$ [Exile olor... 
XP = LxaAl j=20 


The crucial step is the following lemma, whose proof uses a generalization of 
theorem 9,A.13. 
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Lemma 9.A.18. /f xo, x1,X2,--.,xi are nontrivial characters and 


then Jo(Xo.X1,-- +) X0) =O. If xox xi = 1, then 
-1 
Jo(xos Xt5--+.Xt) = *—alxo)a(x1) + 90a). 


Proof. Let Jo = Jo(xo,X15---.x1) and 
Sy = Alxasxas-:xa) = SS xa(er)xeo(u2)---xr(en). 


ty pugt--+u =] 


Then. 
Jo= 3 y xo(t)xa(e1) +++ xi(z7) 
teFs Rabe papeet 
= SS wba DA 
EFS 


= Gace xe)(-1) [SO xoxr xe) | 
teFy 
Tf xoxa---x7 # 1, we are done since Deets xXox1--:x7(£) = 0 in this case 
(orthogonality relations). If yoy1-+- x = i, the previous equality becomes 
Jo = xo(-D(q — 1). Since 9(xo)a(xe*) = xe(—Da (by corollary 9.4.12), it 
remains to prove that 


= Barge) 90a) 


gax2 + xi) 
Now, hy definition we have 


axa) 9(x2) 900) = SP xi (er )xalwe)-- xe O(ar + 224+ + 2p) 


= Jo(xas xa, x0) + [ SO waar alt) | A 
io 


= Jo(x1, x2a,---. x0) + 9OaNa xDSL. 
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As xix2°--xr # 1, we have Jo(xi.x2.---.X2) = 0 by the first part of the 
proposition and the conclusion follows. C 


Finally, using the previous lemma, we obtain 


bs I axg)x9(@j) + 
XO+ 


saXt J=O 


M = q+ 


the sum being taken over all nontrivial characters y; such that xf = 1 and 
xo'''Xi = 1. The result follows, 


9.A.5 The zeta function of an algebraic variety 


In essence, an affine variety over a field k is the locus in k™ of a bunch 
of polynomial equations in N variables with coefficients in k. A projective 
variety is the locus in the projective space P"(k) of a bunch of homogeneous 
polynomial equations in n + 1 variables and coefficients in k. If X is an 
algebraic variety over Fy, it is natural to consider the number of points of X 
over the various finite extensions Fy». The zeta function of the variety X is 
(up to a convenient normalization) the generating function of the sequence 
obtained in this way, i.e. 


Zx(T) = exp | SD Reed pa 


n>i 


Clearly, Zx(T) is a formal series in T with rational coefficients. 


Example 9.4.19. Consider finitely many polynomials fi. fo,..-, fs in k vazi- 
ables with coefficients in F,, and let a, be the number of solutions in Bx of 
the system of equations 


Sileis-.+) tk) = foley... 2%) = ++ = fo(ei,-..,t%) = 0. 


The zeta function of the affine variety defined by the polynomials fy,..., fs is 
exp (Sao aun). On the other hand, if f,,..., fs are homogeneous polyno- 
mials , the zeta function of the projective variety defined by these polynomials 
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is exp (Snes oayT™), as this time two solutions that differ by a nonzero 
element of Fyn are the same clement of the projective space. 


Remark 9.4.20. Suppose that 
|X(Bgn)] = of + 2} bee te mul uP 


for all n > 1 and some complex numbers z,u; (as we will see, this always 
happens, but this is very difficult to prove). Then 
(L~ Puy)(L — Tua) -: + ea Tu) 


2x7) (1—T2)G— Tx) -—Tz,)’ 


essentially by definition of Zx(T) and by the equality of formal series 


Ss —T = —log(1~ aT). 


n>1 
For instance, if X = P", the projective space, then 


[X(Eqo)}= gh + ql ee ghd, 


: 
i 

T)=|[—= 

x(P) p41-¢r 


Another trivial example is the variety defined by the equation ryz = 1 in 
three-dimensional affine space. Then clearly |X (Fy:)| = (¢* — 1)?, so that 


~ any’ 
a(S eT 
x(T) = Td = a 
Let Z[[T']] be the set of formal series in T’ with integer coefficients. Note 
that in all previous examples we have Zx(T) € Z[[T]. It turns out that 
this is always the case. The following result discusses more generally when 
exp (3°, @T") € ZT]. 
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Proposition 9.A.21. Let a, be a sequence of integers. There exist unique 
sequences of rational numbers by, and c, such that 


exp {So Sr" | =14+ oT" = TJa-7). 


n>1 ue n>1 n>l 


Moreover, all by, are integers if and only if all cn are integers, if and only ifn 
divides Dra, tt (3) aq for all n (where jt is Mébius’ function). 


Proof. The existence and uniqueness of 6, is clear. As for c,, the key point is 
to consider 


log [[ 2 ~ 7") = SO cn log(t ~ I") 


n>1 n>t 


Thus we need to find the sequence ¢, for which Lajn deq = —G@n, for all n. But 
the Mébius inversion formula yields the explicit form 


showing the existence and uniqueness of c,. 

It is clear that if ¢, € Z, then 6, € Z for all n. The converse is proved by 
induction. Since c, = —b;, we have c; € Z. Suppose that cy,..., cn.) € Zand 
observe that f = [],.,(1— 7%)® is invertible in Z{[7], as its constant term is 
1. Thus 


1+), 6jT? 
- f 


(L= Tyee) = TAH Yet ss € A(T}. 
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Considering the coefficient of " in the left-hand side of the previous equality, 
we obtain that c, € Z. Using also the explicit formula of the c,’s in terms of 
the a,,’s yields the last statement of the proposition and finishes the proof. 


Let X be an algebraic variety over F,, say defined by some polynomials 
Ji, fo,....fa in n variables with coefficients in Fy. If 


B= (21,...,€n) € X(Fyn), 


define 
Fr(x) = (24,0$,...,2%). 


It is easy to see that this is again an element of X (Fyn). Let f be the 
smallest positive integer such that z ¢ X(F,;) and associate to a the cy- 
cle (a, Fr(z),...,Fr/~1(e)), of length f. Then X (Fyn) is the disjoint union 
of the cycles of length dividing m, so if @, is the number of cycles of length 
n, then [X(Fym)} = Daim daa. Combining this and the previous proposition 
yields 
2x(T) = T[@- 17 )* € Zr]. 
nal 

Remark 9.4.22. The previous proposition is powerful in other contexts, too. 
For instance, it immediately yields the equality of formal series 


exp(X) = [[—x")-Se, 


n>l 


where j is Mébius’ function. The proposition also easily implies the following 
equality 


exp oo. = J] x7) 


n2o n>1 
ged(n,p)=1 
showing that the Artin-Hasse exponential exp (Sao0 x) has coefficients in 
Z,. This is absolutely not clear from the definition and plays a major role in p- 
adic analysis and also in Dwork’s proof of the rationality of the zeta functions 
attached to algebraic varieties over finite fields. 
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In general, it is a very deep problem to compute the zeta function of a 
given variety. Yet even without computing the zeta function, one can say a 
great deal of things about it! This was conjectured by Weil in the wonderful 
paper [84] and proved after a gigantic work by Deligne and Grothendieck. The 
following theorem is réally one of the most difficult and beautiful results of 
modern mathematics: 


Theorem 9.A.23. (Deligne-Grothendieck) Let X be a non-singular projective 
variety of dimension n over F,. Then the zeta function of X is a rational 
function, More precisely, there are polynomials Py, P;,...,Pan € ZIT] such 


that 
P,(P)P3(P) ++ Pon-(P) 
PL) P2(T) +++ Pan(T) 


2x(T) = 


and 


a) P(T) =1~T and Py, (P) =1—@°T. 


b} We can write P; = aa —wyT), where wy are algebraic integers such 
that {wij| = q'/? for all i,j. 

o fx = 79 (- 1)'b;, then the zeta function satisfies the functional equa- 
tion 


1 n 7 
Zx (=H) = ch(q?T)*Zx(T) 


for some sign +. 
If X is a smooth projective curve of genus g (this is an important invariant 
attached to curves; in the notations of the theorem, we have b = 2g) over Fy, 
yi 
Weil proved in 1940 that its zeta function can be written in the form ata 


for sorne polynomial P € 1+ ¢Z[t]. Moreover, the roots of P have absolute 
value 1/\/q. So in this case bp = bg = 1, b, = 2g and moreover we can write 


|X(Fyr)| = 14g" — WP tw ++: +§,) 


with {w;] = \/@. In particular, we obtain the very nontrivial estimate 


|X (Bye) = P+ g%)| < 2gq?? 
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This last estimate was obtained by Lang and Weil in 1954 for arbitrary va- 
rieties, before the proof of the previous deep theorem. For an even more 
concrete example, consider integers a,b and a prime p > 3. The condition 
that the curve y? = f(x) be non-singular is that p does not divide 4a? + 2742, 
In this case the curve y* = f(a) has genus 1 and one point at infinity. Hence 


X(Bp)| = 1+ [{(2,y) € Fp x ply? = 2° + ae +0} 
and the bound above becomes 
|I{(o,u) € Fp x Fply? = 2° + ax +} — pl < 2Vp. 


This reproves a famous theorem of Hasse (there are however easier proofs, but 
they require a good knowledge of the theory of elliptic curves and quite a lot 
of algebraic geometry). 

Dwork’s p-adic proof (1960) of the rationality of zeta functions works 
for affine or projective varieties, be they non-singular or not. For a non- 
singular projective hypersurface defined by a polynomial f € FyLXi,..., Xl, 


homogeneous of degree d, Dwork proved that its zeta function is of the form 

(-1ynFt : : A) = penis 
eed for some P € 1+-¢Z[t] of degree feria) We will prove 
this in the next section, in the much easier case of diagonal hypersurfaces (a 
famous theorem of Weil). 


9.A.6 Zeta function of diagonal hypersurfaces 


In this part, we show how to compute the zeta function of a diagonal 
hypersurface, This beautiful result, due to Weil was also the starting point of 
the famous Weil conjectures. 


Theorem 9,A.24. (Weil) Let | > 1, mlq—1 and let X be the projective 
hypersurface of equation agxh' + ayxt' +++ + ajzf° = 0. There exists P € Z[T) 


of degree d= i) a such that 
my PT 
@) 2x(2) = goat 


b) If P(z) = 0, then 1/z is an algebraic integer of absolute value = 
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c) There exists an explicit integer y such that 


2x (geap) = # (02) zat 


Proof. Recall that by theorem 9.4.16 we have for any q an equality 


t-1 
N= ae t= Waboxt xh 


FO XOX Eye KE 


the sum being taken over all nontrivial characters y; of Fy such that yi" = 
land yoxr---x; = 1. The main point is to study how the numbers 
Won(xo,-+-,X2) vary and this is accomplished by the Davenport-Hasse re- 
lation. More precisely, recall that for a character x of F] we have a character 
Xn(z) = XCM En /,(®)) and that x ~+ yn induces a bijection between charac- 
ters of Fj of a given order and characters of jn of the same order (proposition 
9.4.7). So, if S is the set of 1+ l-tuples (xo,..., x) of nontrivial characters 
of F) such that y?" = 1 and yoyy-- “xy = 1, then 


1-1 
; 1 — 
[X(Byn)] = » g™ + a ys Won (X0,n Xin +0 Xin). 
J=0 Ooxirsaes 


On the other hand, by the Davenport-Hasse relation (theorem 9.4.14) we can 
write 


Wor (0,0 Xt.ns += Xin) = (Dt (-1O OW, (x0, x2)”. 
We deduce that 


ia 4 7 
Xen) = Sa = (-1! (PP wice..-.20) 
j=0 ¢ 


(Xora XIES 


and so by the previous paragraph we fin ally obtain the first part of the theorem, 
with : 


Welxa,-.- xT) . 
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Note that Wo(xo.-..,X:) # @ for all (yo,..-,x7) € S, as the Gauss sum 
associated to a nontrivial character is nonzero. Thus deg(P) = |S]. It remains 
to find this number. See || as a function f(J) of 1. Let g(l) be the number 
of 1 + 1-tuples (xyo,.--, xz) of nontrivial characters such that x” = 1 and 
xo Xt # 1. Clearly, g() = fd +1). But f() + g(2) is just the number of 
tuples of nontrivial characters such that x7" = 1. As there are m— 1 nontrivial 
characters of order dividing m, we deduce that f(!) + g(2) = (m—1)41. One 
immediately deduces that 


2 (m ase pe + (~1)m ~ 1) 
A eae ann 


finishing therefore the computation of deg(P). 
By definition of Wy(xo,-.-,x2) and by the fact that [g(xi)| = 4/@, we 


deduce that [Wo(xo,---,x0)] = qt. This yields part 2) of the theorem. 
Next, the fact that goagoq") = xi(—Lg and xo--- x; = 1 implies that 


Wit ax) = 


As clearly the set 5 is stable by inversion, we deduce that the map z > oir is 
a permutation of the roots of P, From here, it is an easy but tedious exercise 
to deduce the third part of the theorem. 

Finally, it remains to prove that P € Z[T). As Zx(T) € QIIT'], we 
must have P € Q(T]. We will prove that the coefficients of P are algebraic 
integers, which will be enough to conclude. It is enough (taking into account 
the definition of P) to check that W,(xo.....x:)/q is an algebraic integer. As 
xi(a:) are roots of unity, it will therefore suffice to check that 2292900) is an 
algebraic integer. But this is an obvious consequence of lemma 9.4.18, This 


finally proves the theorem! Q 


Addendum 9.B A Glimpse of Algebraic 
Number Theory 


This addendum recalls the basic properties of number fields. Of course, 
one would need a whole book (and actually much more...) to properly develop 
the theory of number fields, as even proving the basic properties requires a 
lot of commutative algebra. We will try to stay as elementary as possible, 
while still giving some proofs. We warn the reader that a long part of this 
addendum is very abstract. To see the power of the notions and theorems 
discussed, we advise the reader to start with the last part of the addendum, 
which discusses applications to problems with very elementary statements and 
very non-elementary solutions. .. 


9.B.1 Ideals and quotient rings 


Let R be a commutative rmg, An ideal of R is a nonempty subset 7 of R 
which is stable under addition and such that av € J for alla € Randaxeé I, 
Note that this is far stronger than the stability of J under multiplication. It 
is fairly easy to construct ideals of R: if 21, r2,...,2, € R then 


(21, 22,--..%n) = {a1v1 +-+- + anty|a; € R} 


is obviously an ideal, called the ideal generated by 71, .r2,.-.,2n- 

Once we have an ideal J in a ring R, we can naturally construct a quotient 
ring, whose elements are coset classes @ = a+ J with a € R and addition, 
multiplication are defined by @+6 = a+6 and @-b = ab. It is an easy exercise 
to check that it is well-defined (the issue is that we may have @ = a’ even if 
a # a’ and one needs to check that if @ = a! and b = 6, then a+b =a'+¥, 
similarly for multiplication). 


Definition 9.B.1. | 1) An ideal J of R is called maximal if J # R and if f 
is not contained in any ideal different from J and R. 


2) An ideal J of R is called prime if / % Rand ab € R~J for any a,b € R—I. 
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There is a very nice characterization of prime and maximal ideals of a 
ring in terms of quotient rings. The proof is essentially trivial unwinding of 
definitions, but the result is crucial: 


Proposition 9.B.2. 1) An ideal I of R is maximal if and only if R/T is a 
field. 


2) An ideal I of R is prime if and only if R/I has no zero divisors. 


As a field has no zero divisors, this proposition implies that any maximal 
ideal is a prime ideal. There are however prime ideals which are not maximal: 
the ideal (2) in Z[X] is prime and not maximal, as the quotient ring is F2{X1, 
which has no nonzero zero divisors but is not a field. 

There are natural operations on ideals: if I, J are ideals of a ring R, one 
defines their sum J+J = {i+ j|i ¢J,j € J}. It is easy to check that this is 
an ideal. The analogous definition for multiplication would fail {in general) to 
yield an ideal, so one defines the product of ideals J, J as the ideal generated 
by all products ¢-j with (i,j) € I «x J. 


9.B.2 Field extensions 


We say that L is a field extension of K if both K, L are fields and K C L. 
The extension’ L/K is called finite if L is a finite dimensional K-vector space. 
In this case, we define the degree of the extension to be 


[E: K] = dimx(ZL). 


For instance, the extension C/R has degree 2, as 1,7 is a basis of C over R. 
On the other hand, the extension C/Q is infinite (for example, because C 
is uncountable and Q is countable). We will mostly be interested in finite 
extensions, for which the following result is of constant use: 


Proposition 9.B.3. Let L/K and M/L be finite extensions of fields. Then 
M/K is finite and 
[M:K] =[M:Z]-[£: Kj. 


"This notation should not be confused with the quotient ting previously discussed, simply 
because K is not an ideal in Z unless K = L. 
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Proof. One can easily check that if («;); is a basis of M as L-vector space 
and (y;); is a basis of L as K-vector space, then (x:y;)ij is a basis of M as 
-veector space. Oo 


9.B.3 Algebraic numbers and algebraic integers 


If L/K is an extension of fields and if / € L, we say that / is algebraic 
over K if there is a nonzero polynomial f € K[X] such that f(J) = 0. In 
this case, there is a unique monic polynomial 7; € K[X] of least degree which 
vanishes at !, It is called the minimal polynomial of J and it is irreducible in 
K(X], by minimality. The division algorithm shows that the only polynomials 
f € K{X] such that f(J) = 0 are the multiples of 7. Recall that K(Q) is the 
smallest field containing K and | and it can also be described as 

KO) = {Fringe KIX}o #0} = Ki = FOIE € KIX} 
To prove this equality, it suffices to show that if f¢ K{X] does not vanish 
at I, then 70 is of the form A(t) for some A € K[X]. But since f(l) 4 0 
and 7 is irreducible, f and 7 are relatively prime, so there are polynomials 
A, B € K|X] such that Af + Bm = 1. Evaluation at / yields the result. The 
following proposition is easy, but fundamental. 


Proposition 9.B.4. Let L/K be any extension of fields. 


1) Let 1 € L be algebraic over K. Then there is an isomorphism of K- 
algebras® between K(l) and K[X]/(m). Moreover, [K(1) : K] = degm < 
00. 


2) Conversely, if € L and [K(1) : K] < 00, then l is algebraic over K. 


Proof. 1) Consider the map sending f € K[X] to f(J). It is a map of 
K-algebras, vanishing precisely on the ideal (m), by definition of 7. 
It is easy to check that it induces an isomorphism between K[X]/(m) 
and K{l], obtained by sending f + (m) to f(J). Next, if d = deg m, 


“This means an isomorphism of rings which is K-linear. 
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then (the classes of) 1,X,...,X4"! form a K-basis of K(X]/(m) and 
so [K[X]/(m) : K] = d. Combining this with the previous isomorphism 
finishes the proof. 


2) This is clear: if d = [K(2) : K], then 1,1,7,...,J¢ @ K(D) cannot be 
linearly independent over K. This forces a nonzero polynomial equation 
with coefficients in K satisfied by | and so l is algebraic over K. 

CG 


Combining the previous results, we can now prove the following nontrivial 
result (which can also be obtained using the theorem on symmetric polynomi- 
als): 


Theorem 9.B.5. Let L/K be any field extension. The set of elements of L 
which are algebraic over K forms a subfield of L. This subfield is equal to £ 
if L/K is finite. 


Proof. If ly,l2 € L are algebraic over K, then by proposition 9.B.4 K(h)/K 
and K(i))(/2)/K(i) are finite extensions (note that [2 is also algebraic over 
K()). Thus by proposition 9.B.3, K(t)(I2)/K is finite. But K(U)(2) con- 
tains K(ly +12), K (ily) and K(l,/l2). We deduce that if x € {hth hh. h/l}, 
then K(a)/K is finite and the result follows by proposition 9.B.4. The second 
part is also a trivial consequence of proposition 9.B.4. Q 


Definition 9.B.6. 1) A number z € C is called algebraic if it is algebraic 
over Q. It is called an algebraic integer if its minimal polynomial over g 
has integer coefficients. By Gauss’ lemma, this is equivalent. to the fact 
that z is root of some monic polynomial with integer coefficients. 


2) We denote by @ (respectively Z) the set of algebraic numbers (respec- 
tively algebraic integers), 


The following result is an easy consequence of the theorem of symmetric 
polynomials 9.10. 


Theorem 9.B.7. © is an algebraically closed field and Z is a ring. For any 
2 €Q there exists n > 1 such that nz € Z. 
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Proof. The previous theorem shows that @ is a field. Suppose that « € C 
satisfies 2" + a,_12"-! +++» + a9 = 0 for some a; € @. We want to prove 
that z € Q. Let al? be the conjugates of a, (ie. the roots of the minimal 
polynomial of ax, including e,). The theorem of symmetric polynomials easily 
implies that, 


fe TL rea xe ty... tal) 


tl 
Reqs sRn—-2 


has rational coefficients and vanishes at x, from where the result follows. To 
prove that Z is a ring, consider for instance z,y € Z and let #y,...,¢, and 
Yiy 2>---;¥%m be all roots of the minimal polynomials of x and y. Another 
application of the theorem of symmetric polynomials shows that Ih (Xx - 
x; — yj), respectively []; ;(X — 2; - yj) have integer coefficients and vanish at 
2+ y, respectively «-y. Finally, to prove the last statement, take « € Q and 
choose integers a9,a1,...,@, such that ana” + a,-;¢"7! +---+ a9 = 0 and 
Gn # 0. Then 


(@,%)" + Qp—1(ane)”) +--++ agah™? = 0, 


80 Gz LE 


Let us introduce now the main object of this addendum. 


Definition 9.B.8. A number field is a finite extension of Q. If K is a number 
field, then we let Ox = KOZ. 


Ezample 9.B.9. 1) Let d # +1 be a squarefree integer and consider K = 
Q(V4) (as usual, if d < 0, Vd = i/—d). Then K has degree 2 over Q, 
but the structure of Ox depends on the residue class modulo 4 of d: if 
d= 1 (mod 4), then Ox =Z [94], while if d = 2 or 3 mod 4, then 


Ox = Z[Vd]. This is not difficult to prove: imagine that « € Ox and 
write z = a+bV/d. If b = 0, then a must be an integer (since it is rational 
and algebraic integer), so we are done. Otherwise, the conjugates of « 
are ¢ and y = a ~ b/d. Thus 2a and a? — db* must be integers, which 
easily implies the desired result. 
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2) Let G, be a primitive nth root of unity in C and consider K = Q(G,). 
The irreducibility of cyclotomic polynomials (a fairly nontrivial theorem} 
implies that the nth cyclotomic polynomial is the minimal polynomial 
of Cn, so that K has degree y(n) over Q. One can prove with quite a 
lot of effort that Ox = Z{q], ie. there are no algebraic integers in K 
except for the obvious ones. 


9.B.4 Factorization in Ox, the fundamental theorems 


Since the proofs of the theorems stated in this section are rather long and 
technical, we will simply state them without proof, referring the reader to any 
basic number theory book, We prefer to focus on their arithmetic applications. 
Let K be a number field of degree d over Q and let Ox be the subring of K 
consisting of algebraic integers. 


Theorem 9.B.10. If I is a nonzero ideal of Ox, then Ox/T is a finite ring. 


Remark 9.B.11. Actually, one can prove that if [K : Q| = d, then there exist 
#1,%2,...,2q € Ox such that the map Z4 + Ox sending (mj,ng,... Na) to 
ya, +--+» + nav is a bijection. This easily implies the previous theorem: let 
a € I be nonzero, then the norm n of x is again in I and is nonzero, Hence 
J contains nOx. But the previous result shows that Ox /nOx is in bijection 
with Z4/nZ*, which is finite, with |ni4 elements. 


Corollary 9.B.12. If g is a nonzero prime ideal, then Ox /@ 18 @ finite field 
and 80 9 is @ maximal ideal. 


Proof. The ring R = Ox/g is finite with no zero divisors. Let x # 0 be any 


element of R. As R is finite, there must be i < j such that a! 2, ‘Then 
z'(x!~* — 1) = 0 and as there are no zero divisors in R, we obtain z/-? = 1, 
Thus < is a unit. This proves that R is a field and the result follows, O 


Definition 9.B.13, If J is a nonzero ideal of J, we define its norm 
N() = |Ox/J|. 


This is an integer which lies in J, by Lagrange’s theorem in the group Ox/T. 
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We cannot emphasize enough the importance of the following result f 
the theory of algebraic number fields. Suffice it to say that it plays Sicily FA 
same role as the fundamental theorem of arithmetic (i.e. the imi ue faete is 
Wation theorem) and we will leave the reader recall that basicall, x mn = is 
of elementary number theory follow from it. eo) ye 


Theorem 9.B.14, (Kummer, Dedekind) Let K be a number field. 


a) Any ideal of OR, different from 0 and Ox, can be uniquely (up to per- 


meutation) written in the form PL Pgrrees n for some n> 1 and some 
prime ideals g;, not necessarily distinct. ' - 


6) HfL and J are two nonzero ideals of Ox, then N(i-J) = N(Z)- N(J) 
Moreover, Ic J if and only if there exists an ideal J’ such that I = J.J ‘ 


Remark 9.B.15. One can also prove that for all x € Ox we have 
N(zOx) = iN(a)}f. 


Meade Sanpede that i? has degree d over Q and that pis a rational 
7 * a POK = ei ‘py Zio 9’ be the factorization of the ideal pOx. 
8 PUK has norm p* (by the first part of the remark), theorem 9.B.14 yields 


a 7 
p® = N(gi)®..... N(9)&? and ae 
gy ind so there are nonnegative integers f, g 
N(i) = ph. We have the fundamental relation a iain 


eft t+ e2fate + eg fy = [K : Q]. 


ieee sgenree K be a number field, let g be a nenzero prime ideal of 
: K an let © € Ox be prime to @ (ie. gy does not appear in the prime 
eagones of zOx or, equivalently, tO, + @ = Or). Since Ox /@ is a field 
with N(o) elements, we obtain from Lagrange’s theorem the following useful 
analogue of Fermat’s little theorem: ¢N(®)-1 = 4 (mod gy) 


Consider now a finite extension 
C \ ? extension L/K of number fields el 
nonzero prime ideal of Ox. Let : ore eee ee 


POL = {ayx1 +agay +... + GyXn\n > Laz € R, 4; € Or}, 
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which is easily seen to be a nonzero ideal of Or. Moreover, pO, # Ox, as 
otherwise we would have 1 = ayxy + +--+ QnZp for some a; € ~ and some 
z; € Oy and by taking norms, we would get 1 € g. Hence, by the previous 
theorem, there exists a prime 8 of O, dividing @O,. By the same theorem, 
this simply means that @ C 8. Note that 8 is not unique, but there are only 
finitely many possibilities for it. On the other hand, given a nonzero prime 
ideal 8 of Oz, there is a unique prime g of Ox such that oC f. Indeed, the 
existence is obtained by taking = 8NO, (it is an easy exercise for the reader 
to check that this is indeed a nonzero prime ideal of Ox) and the uniqueness 
follows from the fact that any nonzero prime ideal of Ox is maximal (hence, if 
gi C 8 and g C 8 for some different primes gp; Of Ox, then 1 € @) + @2 C B, 
a contradiction). : 


Definition 9.B.18. Let L/K be a finite extension of number fields and let p 
be a nonzero prime ideal of Ox. A prime ideal 6 of Oz is said to tie above g 
if » C B. We also write Bip in this case. 


We can resume the previous discussion by saying that any prime @ of O i 
has a unique prime p of Ox below it, namely 89 Ox and any prime @ of 
Ox has at least one (but finitely many) prime § above it. Note that if Bly, 
then the inclusion Ox C O; induces an injective morphism Ox / > > O1/8, 
realizing therefore O,/8 as a finite extension of Ox /§. 


Definition 9.B.19. Let S/o be as above. 
1) The residual degree of 8/g is defined by f(8/p) = [O,/8 : Ox / gi. 


2) The ramification index of 8/ is the exponent e(8/) of 8 in the prime 
factorization of pO,. 


Note that by definition we have 
pO, = Il pete/e), 
ble 


Using this and proposition 9.B.3, we easily obtain the following useful property 
of the ramification index and residual degree in a tower of number fields. 
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Proposition 9.B.20, Let M/L/K be a tower of finite extensions of number 
fields and let p|B\p be primes of Ou, Or and Ox. Then 


e(o/B)- e(8/9) = e(p/p), f(e/B8)- f(B/p) = fe/@). 


Definition 9.B.21, Let L/K be a finite extension of number fields, A prime 
e of Ox is called 


a) wnramified in L if e(8/p) = 1 for all Blo in L. 


b) totally split in L if e(8/p) = 1 and f(8/@) = 1 for all Slo in L. This 
is equivalent? to: gO, is the product of [L : K] different prime ideals of 
On. 


The following theorem gives a practical way to factor a prime in a number 
field. The precise statement is a bit complicated, but the message is very 
simple: for most primes p, factoring pOx in Ox comes down to factoring 
f € E,[X], where f is the minimal polynomial of any primitive element of K 
that lives in Ox. 


Theorem 9.B.22. (Dedekind, Kummer} Let K = Q(x) be a number field, 
where x € Ox has minimal polynomial f. Let p be @ prime which does not 
divide! |Ox /Z{x\| and let 


be the prime factorization of f € F,[X]. Then 


g 
pOn =[[ of, 


ist 


where 9: = pOx + fi(x)Ox are different prime ideals and N(@;) = p®®*. 
°We have > sip (8/ p)-. 78 /@) = {L: K}, by an argument similar to that used in remark 
9.B.16, 
‘Note that the result stated in remark 9.B.11 shows that Ox /Z|z] is a finite set. 


9.B. A Glimpse of Algebraic Number Theory 465 


Proof. Lift arbitrarily fj to some monic polynomials f; € ZX]. The key 
point is to prove that the natural map Z[X1/(p, f,) + Zle]/(p, fi(a)) sending 
(the class of) h to (the class of) A(x) is an isomorphism of rings. Assume 
for a moment that we proved this. Let @ = |Ox/Z[z]|, so ged(p,a) = 1 
and aOx C Zig], A standard argument using Bézout’s lemma shows that 
Ox = pO + Zz], thus Ox/@; is naturally isomorphic to Z[z]/(p, fi{x)). 
p*s 4 elements, it follows from the previous discussion that the g; are different 
prime ideals with N(g;) = pt. Since of C pOx + fila)"Ox, f(x) = 0 
and f = []%., ff (mod p2Z[X)}), it follows that []%_, of ¢ pOx and so by 
theorem 9.B.14 we have pOx| []{, f'. So, if pOx = []%_, of, we must have 
8; < e;. We conclude by noting that both sums }7, e;-deg f; and 37, s;-deg fi 
are equal to [K : Q] (see remark 9.B.16), so we must have s; = e; for all 2. 
Let us prove now that Z[X]/(p, f:) > Ziz|/(p, fi(w)) is bijective. Since 
surjectivity is clear, it remains to prove the following assertion: if h € Z[X] 
satisfies h(x) € p&lax] + fi(x)Zia}, then h € pZ[X) + fi(X)- Z[X]. Write 
h(x) = pA(x) + fi(z) B(x) for some A,B € Z[X]. Since f is the minimal 
polynomial of «, there is r € ZLX] such that h = pA+ fi:Birf. It suffices to 
use again that f € pZ[X] + f;- Z{X] to finish the proof. 


Remark 9.B.23. It is not very difficult to prove that if p does not divide the 
discriminant of f, then p does not divide |Ox/Z{z]}, so the theorem can be 
applied. 


9.B.5 Two classical examples 


Consider a squarefree integer d # +1 and let K = Q(./d). We saw that 
Ox = Zia}, where x = iva if d = 1 (mod 4) and 2 = V/d otherwise. The 
minimal polynomial of x is X? ~ X + i in the first. case and X? — d in the 
second case. Theorem. 9.B.22 shows that in order to understand the prime 
factorization of pOx, we need to understand the prime factorization of these 
polynomials modulo p. Since a quadratic polynomial modulo p is irreducible 
if and only if it has a root in F,, we easily deduce the following 
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Proposition 9.B.24, Let d 4 +1 be a squarefree integer and let K = Q(Vd). 
Let p be a prime. 


a) Ifp > 2, then pOx is a prime ideal if (4) =: ~1, @ product of two 
different prime ideals if (4) = 1 and the square of a prime ideal if pid. 


b) Ifp = 2, then pOx is the square of a prime ideal if d = 2,6,3,7 (mod 8), 
a prime ideal ifd = 5 (mod 8) and a product of two different prime ideals 
if d = 1 (mod 8). 


Consider now n > 1 and let K = Q(G,), where ¢, is a primitive nth 
root of unity. As we have already said, it is rather difficult to prove that 
Ox = Z¢,) and we will take this for granted (actually, it is easier to use 
remark 9.B.23). In this case, theorem 9.B.22 reduces the prime factorization 
of pOx to that of dp € F,[X], where @_ is the nth cyclotomic polynomial 
(whose roots are precisely the primitive nth reots of unity). Assume that p 
does not divide n and let g be an irreducible factor of degree f of én € Fp[X]. 
Let x be S Tog of g in an algebraic closure of Fy. Theorem 9.A.4 shows that 
positive power ve « equal to 1 (because x is a root of ¢,,), it follows that the 
sequence 2?" is periodic with period d, so we must have f = d. That is, dp 
factors mod p as a product of irreducible polynomials of degree d, the order of 
pmodule n. Since X" —1 is squarefree modulo p (because the hypothesis that 
p does not divide n implies that X” — 1 is relatively prime to its derivative), 
we deduce that in theorem 9.B.22 we have e; = land fp=d 
for all i, so g = 29) Assume now that pin and let n = Me m for some m 
relatively prime Bi p. it is easy to see that Gn(X) = dm(X e \idm(X ot *), so 
modulo p we have ¢, = ee —e , This reduces the problem of factoring pOx« 
to the previous case, All in all, we have the following useful result: 


€g = +++ = €, 


Proposition 9.B.25. Let n be an integer greater than 1, let K = Q(G,) and 
let p be a prime. 


(2) len F 
aa ned) different prime 


a) If ged(n,p) = 1, then pOx is a product of 
ideals, each of degree ord(p mod n). 
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6) Ifp|n andn = p*-m with gcd(m, p) = 1, then pOx = (p10 --§)PerP 


where s= aes and each go; is of degree ord(p mod m). 


9.B.6 The primitive element theorem and embeddings of 
number fields 


When working with subfields of C, the following result is very handy. We 
will use it constantly to shorten proofs of results which actually hold in much 
greater generality. 


Theorem 9.B.26. (primitive element theorem) Let L/K be a finite extension 
of subfields of C. Then there exists 1 € L such that L = K(I), 


Proof. As L/K is finite, there are elements ¢),...,2, € L such that L = 
K(x1,22,...,0,) (for instance, the elements of a K-basis of L over A). Thus, 
by induction on n it is enough to prove that if z,y are algebraic over K’, then 
there exists € £ = K (x,y) such that Z = K(D. 

Let. f,g € K[X] be the minimal polynomials of «x, y respectively and let 
1 = 2, %2,...,4n and yi = ¥,¥2,.--.Y¥m be their roots. Clearly, there exists 
c © & such that x; + cy; # x + cy for all (é,7) 4 (1,1) (each of the previous 
linear equations in c has at most one solution in K and K is infinite). We 
claim that 1 = « + cy works. Clearly K(J) ¢ L, so it is enough to check 
that z,y € K(). As l = 2 + cy, it is enough to do it for y. Now, we 
know f(! ~ cy) = 0, so the polynomial f(U ~ eX) € K(J[X] has a common 
root y with g. But by construction this is the only common root of these 
polynomials. Moreover, it is a simple root, as g is irreducible over Q, so it has 
simple roots. Finally, we conclude that the greatest common divisor of these 
two polynomials is X ~ y. As these two polynomials have coefficients in K(0), 
so does their greatest common divisor and so y € K(l). The result follows. O 


We will use the primitive element theorem to prove some basic results on 
the structure of number fields. The first one concerns the embeddings of a 
number field in C. Note that Q(V2) has two such embeddings, namely the 
identity map and a + by2 4 a ~ 6/2. It turns out that a number field with 
degree d has exactly d embeddings in C. 
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Theorem 9.B.27. Let L/K be an extension of number fields (L/K is au- 
tomatically finite). Then any embedding K -+ C extends to exactly [L : Ki 
embeddings L > C. 


Proof. Fix an embedding o : K — C and use the primitive element theorem 
to write L = K(l) for some 1 @ L. Then 1 is algebraic over K’, of degree 
d= {L: K), with minimal polynomial f € K(X]. Suppose that o: 1 3C 
is an embedding that extends o, so o’(x) = o{z) for allz &€ K. Thus, if 
g € KIX], then o’(g(1)) = g?(o’(1)) (where g’ is the polynomial obtained from 
g by applying ¢ to its coefficients) and so o” is determined by o'(), which has 
to be a root of f% (use the previous equality with g = f). Conversely, if [is 
a root of f%, we can define an embedding o'(g(/)) = g*(U’) for all g © K[X], 
well-defined because any two polynomials g and h with g(1) = h(l) differ by a 
multiple of f. Thus the embeddings of L into C that extend o are in bijection 
with roots of f? and the result follows. CQ 


Taking K = Q in the previous theorem, we deduce that any number field 
L of degree d over Q embeds in exactly d ways in C. 


9.B.7 A bit of Galois theory 


Let L,M be two extensions of a field K and suppose for simplicity that 
they are contained in C. A K-morphism L -> M is a K-linear map from L to 
M which is also a ring homomorphism. Stated differently (but equivalently), 
it is an additive and multiplicative map f : L -» M such that f(2) = x for all 
eek. 


Definition 9.B.28. Let f € K{X]. The splitting field of f is the field 
K(z1,22,...;0n), where z; € C are the roots of f. 


Note that a K-morphism f : K(#i,...,a) > M (where M/K is an 
extension) is uniquely determined by the values f(a;), as any element of 
K(a1,...,;2n) is a polynomial with coefficients in K in the 2;’s. However, 
f(«;) cannot be any element of M, since if P € K[X] kills 2;, then P also 
kills f(z): note that P(f(#;)) = f(P(2i)) = 0, as f is K-linear and multi- 
plicative. Moreover, there might be algebraic relations with coefficients in K 
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between the a;’s and the same argument shows that the numbers f(.;) must 
satisfy these relations. Thus, it is a fairly delicate issue to understand these 
K-morphisms. This is the content of Galois theory. Let us make an important 
definition first: 


Definition 9.B.29. Let L be an extension of a field K. The Galois group of 
EL over K, denoted Gal(L/K) is the set of bijective K-morphisms f : L + L. 


We can now prove one basic result of Galois theory, which is also of 
constant use: 


Theorem 9.B.30, Let L/K be an extension of number fields. Then 
IGal(L/K}| < [L: K}, 
with equality if and only if L is the splitting field of some polynomial f ¢ K[X]. 


Proof. Let L = K(1) for some 1 € L. Each element o € Gal(L/K) is uniquely 
determined by o(), which must be a conjugate of /. Thus we have a natural 
injection of Gal(L/K) in the set of conjugates of 1, which has {L : K] elements. 
The inequality follows. 

Suppose that we have equality and let f €¢ K[X] be the minimal poly- 
nomial of 1. We will prove that D is the splitting field of f. It is enough 
to prove that if x is a conjugate of 1, then « € L. But the first paragraph 
and the equality case implies that if l,,...,/, are the conjugates of 1, then 
{o(Dlo € Gal({L/K)} = {h,...,t,}. Thus there exists ¢ € Gal(L/K) such 
that x = o(l). Since o(L) CL, we have x € LE and we are done. 

Conversely, suppose that L is the splitting field of some f € K[X], with 
roots £1,£2,...,2,n. Theorem 9.B.27 applied to the natural inclusion map 
K — C shows that there are [£ : K] K-linear morphisms of rings L > C. 
But for any such morphism o : L -+ C we have o(z;) € L, as o(a;) is just 
some root of f and Z contains all these roots. Thus the image of any such 
morphism is a subset of L, ie. any such morphism is an element of Gal(L/K). 
The result follows. im 


Definition 9.B.31. We say that a finite extension L/K of number fields is 
Galois if |Gal(Z/K)} = [L : K], or, equivalently, if Z is the splitting field of a 
polynomial with coefficients in K. 
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We are now able to prove the main theorem of Galois theory for number 
fields. The result holds in much greater generality, but we will not need it 
and we prefer to use all the extra data in order to shorten the proof rather 
dramatically. 


Theorem 9.B.32. Let L/K be a finite Galois extension of number fields. 
Sending a subfield M of L containing K to Gal(L/M) yields a bijection between 
subfields of L which contain K and subgroups of Gal(L/K). The inverse of 
this bijection is the map sending the subgroup H of Gal(L/K) to 


L! = {2 € Llo(x) = 2,Vo € HH}. 
Moreover, we have Hy C Ho if and only if L™ contains L¥2. 


Proof. Let us prove first that LS*(Z/M) — jy for any intermediate field AM 
between L and K. Note that L/M is again finite and Galois (as L is the 
splitting field of a polynomial with coefficients in K, thus also in M ). Write 
L = M(1) for some primitive element | ¢ L, with conjugates f1,l2,...,1, over 
M. Suppose that « ¢ LS“/* is not in M. Thus, we can find f € M[X] 
nonconstant of degree less than n and such that x = f(1). We saw in the proof 
of the previous theorem that {o(I)|¢ € Gal(L/M)} = {h,..-.dn}. So, for any 
a we can find o; € Gal(L/M) such that oj(1) = 4. Then x = o; (2) = f(l). 
Hence f(li) = f(l) = --- = f(In), contradicting the fact that f is nonconstant 
of degree less than n. The result; follows. 

Next, we need to prove that if H is any subgroup of Gal(L/K), then 
Gal(L/L") = H. By the very definition of L” we have an inclusion H C 
Gal(L/L). It is thus enough to check that |Gal(L/L")| < |] and, using the 
previous theorem, it is enough to check that (ZL: L”] < |H]. Write L = L# iG) 
for some 1 € L. Consider f(X) = T],24(X ~ a(t). This is a polynomial of 
degree |#| whose coefficients are clearly in L”, Hence | has degree at most 
|H| over L* and the result follows. 

It remains to check that Hy, © Hg if and only if L” contains L#. It is 
clear that if Hy C Ho, then L"* contains L"? (any element of L fixed by Hy 
is also fixed by Hj). Assume that L contains L42, so E41 9 #2 = Le, 
But it is clear that the left-hand side of this equality is simply L”, where H 
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is the subgroup of Gal(L/K) generated by Hy and H2. Hence L® = L# and, 
as we have seen, this forces H = Ho and so Hy C He. o 


Remark 9.B.33. Using similar arguments, it is not difficult to show that H is 
normal in Gal{L/K) (i.e. gHg7! = H for any g € Gal(L/K)) if and only if 
L*/K is Galois. 


9.B.8 Prime factorization in a Galois extension 


The results in this section will be crucially used in the applications that 
will be presented at the end of this addendum. They are absolutely funda- 
mental in algebraic number theory. 


Theorem 9.B.34. Let L/K be a Galois extension of number fields, with 
G = Gal(L/K). Let @ be a nonzero prime ideal of Ox and let 8; and Bo be 
two prime ideals of Or which lie over @. Then there exists o € G such that"! 
Bg = o( 81). 


Proof. Suppose that 82. 4 o(8;) for allo € G. Using the general version!” 
of the Chinese Remainder Theorem, we obtain the existence of a € S2 such 
that a ¢ o(4;) for allo € G. Then [],-¢¢(@) € Ox 9 So = y C fy and this 
contradicts the fact that 4) is a prime ideal and a ¢ o71(4;) for any o € G. 
The result follows. oO 


Corollary 9.B.35. Let L/K be a Galois extension of number fields and let 
g@ be a nonzero prime ideal of Ox. Then all primes above p have the same 
ramification index and the same residual degree. 


Proof. For the residual degrees, note that any a € G induces a bijection be- 
tween O; /8 and Ox /o(/3), so these two sets have the same number of elements. - 
We conclude by the previous theorem. Next, if ¢ is a positive integer such that 


‘Note that by definition o(81)} is the set of all o(x), with x & A;. It is easy to check that 
this is again a prime ideal of Oz and that o(:) NOx = 9. 

“This is stated as follows: let A be a commutative ring and let Ii, Iz,...,1n be ideals 
such that I; + I; = A for alli # 7 (which is satisfied if J; are different maximal ideals of A, 
for instance). Then for any 7,...,@n € A there exists x € A such that x— a; € fj for alll i. 
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B® divides gOz, then o(8)© divides o(~O,) = gOz. The result follows again 
easily from the previous theorem. a 


Definition 9.B.36. Let L/K be a Galois extension of number fields, with 
Galois group G. Let 8 be a prime of O,. The decomposition group of 7 is 


Dg = {o € Glo(8) = A}. 


Note that Dg is indeed a group and that any ¢ € Dg induces an auto- 
morphism o : O,/8 > O,/8, which is trivial on Ox/p for 9 = 8M Ox. 
Hence, we have a natural map Dg -> Gal((O,/9)/(Ox/p)). Note that 
Gal((Oz,/8)/(Ox/@)) is a eyclic group, generated by the automorphism 
a -+ M0), since (O;/8)/(Ox/) is a finite extension of finite fields (see 
the addendum 9.A for the structure of finite fields). The following result. is 
rather tricky, but fundamental. 


Theorem 9.B.37. With the previous notations, the map 


Dg ~> Gal((O1/8)/(Ox/9)) 


is surjective. 


Proof. We may assume that O,/8 #4 Ox/g, as otherwise the statement is 
trivial. Let a be a generator of the cyclic group (O,/8)*, so that clearly 
0,/8 = (Ox«/p)la}. Using the general form of the Chinese Remainder The- 
orem (see the proof of theorem 9.B.34), we can find @ € Oy; such that a 
(mod 8) = a and a € § for any 8 # 8 above ge. Let F be the minimal 
polynomial of a over K. Then F € Ox|X] and F(a) = 0, hence F(a) = 0 
in O/B. But then! F(a\()) = 0, so that F(a%®)) € 8. So, we can find a 
conjugate 6 of a so that b-— a%(@ < 8. It is not difficult! to see that there 


f{X)*, so if @ is a root of f in some extension of F,, then a? is also a root of f. 

MSince L/K is Galois, L contains all conjugates of a over K, i.e, all roots of F. Let Mf 
be the field generated over K by these conjugates, i.c. the splitting field of K. Then M/K 
is a Galois subextension of L/K, thus any element of Gal(M/K) extends to an element of 
Gal(L/K). But the isomorphisms K[X]/{F) > M sending X to a and 6 respectively yield 
o@ € Gal(M/K) such that o{a) = 6. 
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exists ¢ € Gal(L/K) such that o(a) = b. We claim that ¢ is in Dg and that 
it induces the automorphism « + 2™(*) of O,/8. First, if o(8) # B, then 
a1(8) # and so by our choice of « we have a € o71(8), ie. o(a) € 8. But 
o(a) — aX) © 8, forcing aN) @ 8 and then a € 8, a contradiction with a 
(mod 8) = a #0, Thus o € Dg. The automorphism induced by ¢ on O,,/8 
is uniquely determined by its action on a (generator of (O;/8)*) and by our 
choice this action is a), The result follows. im) 


Let us fix a prime 8 above g in L. By theorem 9.B.34, all primes above p 
are of the form o(f) with o € Gal(L/K), ie. Gal({L/K) permutes transitively 
the different primes £1,...,@, over g. Since there are precisely |Dg| elements 
of Gal(L/K) which fix 8, we obtain {L: K] = |Gal(Z/K)| = g-|Dg]. But we 
also have [L : K] = e(8/)-f(8/)-g, since all ramification indices and residual 
degrees of the §;’s are the same. Hence |Dg| = e(8/@)- f(8/@). In particular, 
if g is unramified in L, ie. if e(8/p) = 1, then |Dg| = f(8/@), which is 
the same as the cardinality of Gal((O,/8)/(Ox/@)). We obtain therefore the 
following crucial result: 


Theorem 9.B.38. Let L/K be a Galois extension of number fields and let 
be a prime of Ox unramified in O;. Let B\p be a prime of Or over g. Then 
the map Dg ~+ Gal((O,/8)/(Ox/@)) és a bijection. In particular, there exists 
a unique (8,L/K) € Dg such that (8,L/K)(x) = x) for alla € O,. We 
call (8, L/K) the Frobenius substitution of 8 in L/K. 


Remark 9.B.39. Assume that we are only given y. The choice of some 8 over 9 
yields a Frobenius substitution. (6, L/K), but this depends on the choice of 8. 
However, any other prime above g is of the form o(8) for some o € Gal(L/K) 
and it is immediate to check that (o(@),L/K) = 0 0 (@,L/K)oa7} (indeed, 
the right-hand side is in D,,g), which is trivially equal to cDgo™1, and satisfies 
the congruence that uniquely characterizes it). Hence, the conjugacy class of 
(8, L/K) in Gal(L/K) does not depend on the choice of 8 and is called the 
Frobenius conjugacy class of g. Note that if Gal(Z/K) is abelian, then this 
conjugacy class is reduced to an element, which we denote (g,L/K): it is 
(8,L/K) for any 8 over g. 


We leave to the reader to check the following easy results: 
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Proposition 9.B.40. Let M/L/K be a tower of Galois extensions of number 
fields and let plBip be a tower of prime ideals. Assume that p is wnramified 
‘in M (thus B is unramified in M and ge is unramified in L). Then we have 
the following relations. 


a) (p,M/L) = (p, M/K)i(8/), Actually, this relation still holds if M/K is 
not necessarily Galois, 
= 


8) (a, M/K)lp = (8,£/8). 


9.B.9 Bauer’s theorem and Chebotarev’s density theorem 


Before stating the following fundamental result, we need one more defini- 
tion. If S is a set of primes, its Dirichlet density is 


d(S) = lim = 


sod 


if the limit exists. The next very deep theorem was conjectured (and proved 
in some special cases) by Frobenius. It is a vast generalization of Dirichlet’s 
theorem. 


Theorem 9.B.41. (Chebotarev) Let K be a finite Galois extension of Q of 
degree n and let g € Gal(K/Q). Let S be the set eae such that (gp, K/Q) 
is conjugate to g for all prime divisors g of p. Then d(S) = 4 A{hgh lh € G}}. 


Here is a typical application of this deep theorem. 


Example 9.B.42. Let u ibe a ie ime and consider the set S of those primes p = 1 
(mod 1) such that 2°7 = (mod p). These twe conditions are equivalent 
to the statement that xt = ir splits into distinct linear factors in FIX] (as 
2 =1 (mod p) is equivalent to the existence of y € F, such that y! = 2). 
This is also equivalent to p being a product of different primes in Ox, where 
K = Qe MF 9) ) is the splitting es of X'— 2. Using again Chebotarev’s 
theorem, we deduce that d(S) = a But K contains Q(e** ) and Q(/2), 
which have degrees | ~ 1, respectively 1. Thus [K : Q| is a multiple of [(/ — 1) 


and so necessarily [K : Q| = I(— 1) and d(S) = ney iy 
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We will also need the following easy consequence of Chebotarev’s density 
theorem: 


Proposition 9.B.43. Let o € Gal({L/K). There are infinitely many prime 
ideals g of degree 1 of K such that one can find Ale in L with (8,L/K) =o. 


Proof. Take a Galois extension E over Q which contains L. Since L/K is 
Galois, we can extend o to an automorphism again denoted o of Gal(E/K). 
By Chebotarev, we can find infinitely many primes p such that there is plp 
in £ with (p,£/Q) =o. Then D, C Gal{E/K)}, so p = pK has degree 1 
(lemma 9.B.49). Let 8 = pM O,. Then 


(8, £/B) = (9, E/K)\p = (9, B/Q)i, = 


and we are done. o 


Definition 9.B.44. If K is a number field, let P)(K) be the set of rational 
primes p for which pOx has at least one prime ideal factor g of residual degree 
1 (ie. such that Ox/@ is the field with p elements). 

The following result explains why P,(K) is an interesting object: 


Proposition 9.B.45. Let f € Z[X] be @ monic irreducible polynomial, let @ 
be a root of f and let K = Q(@). Then there is ¢ such that 


P\(K) M [e,00) = {p > clda € Zsuch that pif (x)}. 


Proof. If p is a sufficiently large prime, then the residual degrees of the prime 
divisors of pOx are given by the degrees of the irreducible factors of f € F(X], 

by theorem 9.B.14, Hence, for such p we have p € P,(K) if and only if f a a 
lear factor in F,[X], i.e. if and only if there exists ¢ € Z such that p{ f(z). 


Here is a rather nice application of this proposition: 


'' Theorem 9.B.46. (Nagell) If f € Z[X], let P(f) be the set of primes 


p for which the congruence f(x) = 0 (mod p) has at least one solution. 
If fi, fa,-.., fy are nonconstant polynomials with integer coefficients, then 
Pf) A P(fe) N+ OP Ufe) is infinite. 
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Proof. The case k = 1 is a classical result due to Schur, but for the reader’s 
convenience let us recall the proof. If f;(0) = 0, everything is clear. Consider 
the numbers 2, = ae ae We have |z,| > 1 and z, = 1 (mod nJ) if n is 
large enough, so there exists pplz, with p, > n and the result follows. 

The general case is more difficult. We may assume that f; are irreducible 
monic. Let 2; be roots of f; and let K; = Q(z;). Let K be the least number 
field containing all K;’s and write K = Q(x) for some z € Ox with minimal 
polynomial f. Applying the case k = 1 to f and using the previous proposition, 
we see that we can find infinitely many p which have a prime of degree 1 in 
K. Let 8; = pM Ox;. Since f(p/p) = 1, we also have f(6;/p) = 1. Applying 
once more theorem 9.B.22 (this time for each K;) we see that all but finitely 
many primes p (among the infinitely many we have just found) belong to 
nk_,P(fi), which is therefore infinite. go 


The following beautiful theorem due to Bauer was discovered before Cheb- 
otarev proved his theorem. It is however more conceptual nowadays to see 
Bauer's theorem as a consequence of Chebotarev’s density theorem, which is 
what we will do. 


4 Theorem 9.B.47. (Bauer) Let Ky be a number field and let Kz be a number 
field which is Galois over Q. Suppose that there exists a set of primes S of 
Dirichlet density 0, with the following property: if p @ S is in P,(Ky), then p 
is completely split in Ko. Then Ko C Ky. 


Proof. We start with two easy lemmas, which express the condition p € P;(K) 
in a group-theoretic way. They are also results of independent interest and 
will be used in other applications. 


Lemma 9.B.48. Let L be a number field which is Galois over Q, with group 
G. Lett QCK CL be a subfield and let H = Gal(L/K). Suppose that Blolp 
is @ chain of prime ideals in L,K,Q such that p is unramified in L. Then 
f(@/p) = 1 of and only if Dg CH. 


Proof, Note that the decomposition group of 6 with respect to the extension 
L/K is simply Dg mH. But since p is unramified in L, g is unramified in 
L/K and so the decomposition group of 8 with respect to L/K has f(8/) 
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elements. That is, |H © Dg] = f(8/). But this equals also £4/2) — [Dal 
The result follows. fey) Tere 


Lemma 9.B.49. Assume that L and K are as in the previous lemma and let 
p be a prime which is unramified in L. Then 


1) p has a prime factor @ in K with f(p/p) == 1 if and only if there is Blp 
in L such that Dg CH. 


2) p is completely split in K if and only if for all Blp in L, we have Dg cd. 


Proof. 1) is an immediate consequence of the previous lemma. For 2), it is 
clear that if p is completely split in K, then for any Sip we have Dg CH (asif 
g = BOOK, then necessarily f(y/p) = 1). Conversely, if Dg CH for any Alp, 
then the previous lemma. implies that f(o/p) = 1 for any lp in K. But note 
that p is unramified in K, as it is already in L. Thus p must be completely 
split in K. o 


Let us prove now the theorem. Choose! a finite Galois extension L of 
Q containing K, and Ka and let Hj = Gal(Z/K;). By the main theorem of 
Galois theory (theorem 9.B.32), it is enough to prove that A, © He. Choose 
any o € Hy. By Chebotarev’s theorem, there is p ¢ S such that p has a prime 
factor 8 in L with Dg = (¢) (cyclic subgroup of Hy generated by o). By 
lemma 9.B.49, we have p € Pi(K) and so p is totally split in K2, which forces, 
by the same lemma, Dg © Ha. Hence o € Hy and we are done. 0 


Remark 9.B.50. Here is another useful consequence of lemma 9.B.49. Let 
n > 1 be an integer and let H be a subgroup of (Z/nZ)". We claim that a 
prime p not dividing n is totally split in Q(¢,)” if and only if p (mod n) € H. 
Indeed, such a prime is unramified in Q(G,) and by lemma 9.B.49 it is totally 
split in Q(¢,)” if and only if for any lp in Q(Gn) we have Dg C H. Since 
Gal(Q(Gn)/Q) is abelian, this is also equivalent to (p, Q(¢,)/Q) © H, which 
is saying that p (mod n) € H (we naturally identified H with a subgroup of 
Gal(Q(¢n)/Q)). 


For instance, if Ky = Q(z) and Ke = Q(y), consider the extension generated over Q by 
all conjugates of x and y. 
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9.B.10 Finally, a reward: applications to 
“elementary-looking” problems 


In this section we show how the deep results in the previous sections 
can be combined to yield some fairly nontrivial theorems with “elementary” 
aspects. The first one uses a generalization of Bertrand’s postulate to obtain 
the following difficult irreducibility result. 


T Theorem 9.B.51. (Schur) If n > 2 and @1,@2,-..,Gn—1 are integers, then 
the polynomial 


2 


yt tae Se tax +1 


is irreducible in Q. 


Proof. It is enough to prove that f =X" +n@p\X*) 4... +n is irreducible 
in Z[X]. Suppose that this is not the case and let g be a monic irreducible 
factor of f of degree m, with m < §. Choose any prime divisor p of n(n ~ 
1)--+-- (m ~ m+ 1) and consider the reduction f of mod p. The choice of p 
implies that X"-+1 divides f. If f = 9-h, it follows that XP-M4+1 divides 
gh and since deg h =n —m, we must have X |G and so pig(0). 

Let z be a root of g and let K = Q/z], a number field of degree m. Since 
p\g(0), it follows that p divides N(z-Ox) and so we can choose a prime ¢ of 
Ox dividing zOx and such that olp. Let e = e(p/p) < [K: Q] = m be the 
ramification index of @. Since f(z) = 0, we obtain (set aj, = 1) 


! 
, 
up(n!) > min vp laze zt), 
ipl )2 min ip | a 7 


so we can find 7 such that 


Up(t!) > ivy(z) = i 


Using the inequalities Uplil) < et and e < m, we deduce that p < m. 
We have therefore proved that all prime factors of n(n—1)-+--- (n—~m+1) 
are Jess than or equal to m. This contradicts a famous theorem of Sylvester 
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and Schur, which generalizes Bertrand’s postulate and for a proof of which we 
refer the reader to [26]. o 


FTheorem 9.B.52. ifn > 2k, then ) has a prime divisor greater than k. 
Let us consider now the second application. 


Theorem 9.B.53. (Davenport, Lewis, Schinzel) Let f € QIX] be a potyno- 
mial with the following property: any arithmetic progression P C Z contains 
an integer x such that f(x) is the sum of squares of two rational numbers. 
Then there exist polynomials g,h € QLX’ ] such that f = 9? + h?. 


Proof. We may assume that f is not constant. Using Gauss’ lemma, we can 
write f =cfyl--+-- mm" for some ¢ € Q*, some primitive, distinct, irreducible 
polynomials f; € Z[X] and some e, € N*. Fix an index Jj for which e; is an 
odd number and let # be a root of f;- We claim that L = Q(@) contains Qt). 
Using part a) of Bauer’s theorem 9.B.47, it is enough to prove that if ¢ is 
a sufficiently large prime in P,(Q(9)), then q € P, (Q(i)). We will need the 
following standard argument: 


Lemma 9.B.54, There exists a prime qo with the following property: for all 
q € P(L) 1M [qo,00), one can find an integer x such that ug fi(z)) = 1 and 
Uq( fe(x)) = 0 for all k £7. 


Proof. First, choose a prime q; > {Oz : Z[@]]. Take a prime number qe greater 
than all prime factors of (the numerator or denominator of ) ¢ and such that 
gq does not divide ged(f;(x), fila) Nags fx(x)) for any x € Z and any q > qo. 
To prove the existence of go, note that f; is relatively prime to fe Mex; tr, 
write a Bézout relation over Q for f; and fj ‘TI key Fi, and clear denominators. 
We claim that gg = max(q,q2) works. Indeed, let q € P; (LZ) 9 {go, 00]. By 
Dedekind-Kummer’s theorem 9.B.22, we can find y © Z such that ¢ divides 
f;(y). Then q does not divide Fi(y) or Thee; f(y), So one of y or y+ satisfies 
the desired conditions. Oo 


Now, fix qo as in the previous lemma and let q € Pi(L) 9 [qo, oe) and 
x as in the lemma. By hypothesis we can find y = x (mod q?) such that 
f(y) = a? + 8 for some rational numbers a,6. Note that vel fj(y)) = 1 and 
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vg(fa(y)) = 0 for k # j. Then ej = ug(f(y)) = vg(a? + 6?) is odd by our 
choice of y. We deduce that g = 1 (mod 4) and so q is split in Q(é), hence 
q € P,(Q(d)). This proves the claim made in the previous paragraph and we 
conclude that i= I € L. Hence we can write i = h(@) for some h € Q[X]. 
Then f; must divide h? + 1. If G = ged(h — i, fj) € Q(2)[X], it is easy to see 
that Ngyyq(G) = ged(h — 4, fj) - ged(h + @, fj) is a polynomial with rational 
coefficients dividing f;. We deduce that f; is a constant times Ngqy/o(G), 
and this last polynomial is obviously the sum of squares of two polynomials 
with rational coefficients. 

Applying the previous arguments for each f; such that e; is odd, we 
deduce that f is of the form e(u? + v*) for a constant ¢ and some u,v € QLX]. 
But. using once more the hypothesis on ¢ we deduce that c is the sum of squares 
of two rational numbers and the result follows. D 


Remark 9.B.55. Actually, Davenport, Lewis and Schinzel prove a more general 
result: let A/Q be a Galois extension of degree n and let v1, v2,...,Un be 
such that Ox = Zo, + Zvg +---+ Zen. Suppose that f € QLX] has the 
property that any arithmetic progression of integers contains an integer x for 
which the equation f(z) = Nxyg(uivi + uote + +++ + UnUn) is solvable in 
(ui, U2,...,%n) € Q”. If either Gal(K/Q) is a cyclic group or the multiplicity 
of any zero of f is prime to n, then one can find uw, ug,-.-,Un € Q[X] such 
that 


F(X) = Nigga (ta (X), u2(X), .-. tn(X)). 
The proof follows precisely the same ideas, but is a bit more technical. 


WY Theorem 9.B.56. (Schinzel) Let f € ZX] be an irreducible polynomial over 
Q and let n be an integer greater than 1. Suppose that there is e with the 
following property: if p > c divides f(x) for some x € Z, then p (mod 7). 
Then there are a € Q and « polynomial g € Q(G,)[X] such that 


£(X) = a: Nagg,ye(g(X)). 


Proof. Let @ be a root of f an let K = Q(@). We claim that Q(¢,) ¢ K. By 
Bauer’s theorem, it is enough to prove that if p is a large prime having a factor 
of degree 1 in K, then p is completely split in Q{¢,). But if p is such a prime, 
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greater than c and n, then p divides some f(«), so p = 1 (mod n) and p is 
completely split in Q(¢,)- 

Now, write ¢, = u(@) for some u € Q[X], Then, since f is irreducible and 
¢n(u(P)) = 0, the polynomial f divides d, o u in Q{X] (recall that ¢, is the 
nth cyclotomic polynomial, minimal polynomial of ¢, over Q). For j relatively 
prime to n, define fj = ged(f,u —¢h) € Q(¢n)[X]. Let oj € Gal(Q(¢,)/Q) be 
the automorphism sending ¢, to ¢j,. Then clearly f; = o(f,), so 


IL f= Necavelh- 


ged(j,n j=] 


Clearly, the polynomials f; are pairwise relatively prime, so their product 
divides f. But, as we have seen, their product has rational coefficients, hence 
by irreducibility of f we must have f = a- Nawe,.)/a( fi) for some constant a. 
The result follows. im) 


Remark 9,B.57. It is much easier to check that the converse of the theorem 
also holds. 


WIheorem 9.B.58. (Murty) Let f € ZIX] be a polynomial and let ln be 
relatively prime positive integers. Let S be the set of primes p for which the 
congruence f(x) = 0 (mod p) has solutions in Z. Suppose that there exists c 
such that any p © S greater than c satisfies p= 1 (mod n) or p= 1 (mod n). 
Also, suppose that there are infinitely many primes p = 1 (mod n) in S. Then 
P=1 (mod n). 


Proof. Let 0 be a root of f and let K = Q(@) and L = K(¢,), so L/K isa 
finite Galois extension (splitting field of X" ~ 1). Let H be the subgroup of 
Gal(Q(¢,)/Q) generated by the automorphism o), which sends ¢,, to él. Note 
that the hypothesis combined with Bauer’s theorem and with remark 9.B.50 
yield an inclusion Af C K, where M = Q(G,)?. Here is the crucial technical 
result: 


Lemma 9.B.59. Restriction to Q(¢,) yields an isomorphism between 
Gal(L/K) and H. 
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Proof. Since M C K, the restriction of any o € Gal(L/K) yields an element 
of H = Gal(Q(¢,)/M). It is clear that if the restriction of o is trivial, then 
is trivial, as L = K(¢,). Surjectivity is more delicate. Choose a prime p'= | 
(mod 7) which is unramified in L (this excludes only finitely many primes) and 
for which K has a prime g|p of degree 1. The existence of p follows from the 
second hypothesis. Let $ be a prime of L lying over ¢ and let p = 8NQ(G,), a 
prime of Q(¢,) lying over p. Let o = (8, L/) be a Frobenius automorphism 
associated to 8. Since @ has degree 1, we have o(x) = 2? {mod §) for all 
« € O,. We claim that the restriction of o to Q(¢,) is 67, which will prove 
the surjectivity and end the proof of the lemma. But this restriction, call it 
T, is an element of Gal(Q(¢,)/Q) which preserves p and such that r(z) = x? 
(mod p) for all « € Z[¢n], thus it has to be (p,Q(¢,)/Q) = ap (it is easy to see 
that op satisfies the properties which uniquely characterize (¢,Q(¢n)/Q)). It 


remains to observe that op = o;, as p= 1 (mod n). im 


Tt is now easy to finish the proof of the theorem: consider a2 = a} eH. 
By lemma 9.B.59 there is o € Gal(L/K) restricting to a. By proposi- 
tion 9.B.43 we can find infinitely many primes g of degree 1 of K such that 
(8,L/K) = o for some Bip. These primes g correspond therefore to primes 
p such that p = [? (mod n) and for which p|f(x) for some z € Z. The first 
hypothesis yields therefore 7? = 1 (mod n) or [? = I (mod 2). The result 
follows, oO 


Using the tools developed in this addendum, we are also able to prove 
theorem 2.4.6. For the reader’s convenience, let us recall the statement. The 
proof is adapted from the beautiful paper [32], where a much more general 

‘result is proved. 


Prheorem 9.B.60. Let n > 1 and a be integers such that a is an nth power 
modulo p for all sufficiently large prime numbers p. Then either a is the nth 
power of an integer or 8|n and a = 2276” for some integer b. 


Proof. We will use without comment the following well-known, but nontrivial 
result: if an integer a is a perfect square mod p for all sufficiently large primes 
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p, then a is a perfect square. While elementary proofs exist,!® this statement is 
also an immediate consequence of Chebotarev's density theorem 9.B.41 applied 
to the splitting field of X? ~ a. 

So, ifn is even, we already know that a is a perfect square, in particular 
positive (the case a = 0 is trivial and excluded in this proof). Hence, by 
replacing a by —a if n is odd, we may and will assume that a > 0. The crucial 
ingredient of the proof is the following application of Bauer’s theorem 9,B.47: 


Lemma 9.B.61. If z is a complex root of X" — a, then Q(z) C Q(En), where 
Rix 
Gazer. 


Proof. Let K, = Q(¢,) and let K2 be the splitting field of X" — a over Q. 
It is enough to prove that Ko © Ky. Using Bauer’s theorem, it suffices to 
prove that for all sufficiently large primes p € P,(K1), p is completely split 
in Ko. If p does not divide n (which certainly happens if p is large enough), 
the condition p € P; (41) is equivalent to nlp ~ 1 by proposition 9.B.25. Also, 
if p is large enough, then a is an nth power modulo p. These two conditions 
imply that X" ~ @ is a product of n distinct, linear factors in F,[X]. Using the 
Dedekind-Kummer theorem 9.B.22,!” we deduce that p is totally split in Ke, 
which finishes the proof of the lemma. Oo 


Coming back to the proof of the theorem, let d be the least positive divisor 
of n for which Vat € Q. Write Ya = #6 for some c € Q4. By lemma 9.19, 
the polynomial X¢ — ¢ is irreducible over Q and by lemma 9.B.61 we have 
Q(¥e) C QCn). Since Q(G,)/@ is Galois with commutative Galois group, 
any subfield of Q(¢,) is again Galois over Q and so Q( ¥c)/Q is Galois. In 
particular, ¢q- ¢/ce (which is a conjugate of ~/c) is again in Q(¢/c) and so it is 
areal number. We deduce that ¢g € IR, which implies that d= 1 or d= 2, If 


The Jacobi symbol (2) for odd m > 3, defined as the product over the prime factors of 
m with multiplicity of the corresponding Legendre symbols, detects quadratic nonresidues 
mod m and is easily seen to obey quadratic reciprocity. The Chinese Remainder Theorem 
then permits the construction of a “bad” value of m. 

‘Tf be fair, we need the obvious version of this theorem in which Q is replaced by Ky 
and K by Kz = Ki( /a). We leave to the reader to convince himself that the statement and 
proof of this general version are exactly the same. 
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d= 1, then a is the nth power of a rational number and we are done (as a is 
an integer, this rational number is actually an integer). 

Suppose now that d = 2 and write n = 2m so that a is an mth power. 
Note that a is also a perfect square by the discussion in the first paragraph. 
If m is odd, then @ is an mth power and a perfect square, hence it is an 
n= 2mth power and we are done. Suppose that m is even, so 4|n. Then 
a is a 4th power modulo all sufficiently large primes and, by what we have 


we deduce that Q(,/c) Cc RM Q(4) = Q, so ¢ is a square and a is a fourth 
power. If m = 2k with k odd, this implies that @ is a lem(m,4) = nth 
power and we are done again. Finally, assume that n = 2’. u, with i > 3 
and w odd. Since @ is an 2'th power modulo all sufficiently large primes, 
lemma 9.B.61 implies that Q(/c) C Q(¢y:). The extension Q(¢):)/Q is Galois, 
with Galois group (Z/2'Z)*. Its sub-extensions which are quadratic over Q 
are classified by the subgroups with two elements of (Z/2'Z)*, which in turn 
are classified by the nontrivial solutions to the equation 2? = 1 (mod 2+). 
There are three such solutions, giving rise to the quadratic sub-extensions 
Q(V—1), Q(V2), Q( V2). Since \/e is real, we deduce that Q(/¢) C Q(V2) 
and so.c is either a perfect square or twice a perfect square. As @ = c?, this 
finally yields the desired result. Cl 


Chapter 10 


Arithmetic Properties of 
Polynomials 


This chapter deals with elementary, but rather challenging problems con- 
cerning the arithmetic of polynomials with integer coefficients. The techniques 
are very diverse, going from the very basic fact that a-b divides f(a)— f(b) for 
f © Z[X] and a,4 € Z to Mahler expansions, finite differences, p-adic analysis, 
finite fields, ete, 


10.1 The a— d|f(a) — f(b) trick 


One can extract a lot of arithmetic information from the fact that a -- 6 
divides f(a) - f(b) for all polynomials with integer coefficients f, but often 
one needs a lot of inventiveness to do this. The next problems are all quite 
tricky, even though they hide very simple ideas. The first problem is a version 
of a very classical result, stating that there are no nonconstant polynomials 
all of whose values are prime numbers. 


1. Let fi, fo,-... fx be nonconstant polynomials with integer coefficients. 
Prove that for infinitely many 7 all numbers f)(), fo(n),..., fa{n) are 
composite. 
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Proof. Let g = fifa----+ fy and fix a positive integer no. Since none of the 
polynomials g and f, is constant, there are a,b > ng such that 

min(|g(@)], |fi(a)},...,[fe(a)l) > 1 and [fila + |o(a){d}{ > lg(a)| 


for all ¢. Since | f;(a)| divides f,(a + !g(a)|b) and is less than fila+|g(a)|d)|, it 
follows that all numbers f;(a+ |g(a)|b) are composite. The result follows. 


The following two problems deal with periodic points of polynomials with 
integer coefficients. 


2. Let f € ZX) and let n > 3. Prove that one cannot find distinct integers 
©1,€2,-..,€y such that f(a) = 21,1 = 1,2,...,n, indices being taken 
mod n. 


Proof. If such a polynomial f and integers 2; existed, 


®i~ U1 = f (tina) ~ f(a) 
would be divisible by 2:41 — 2. In particular, |2; — ani] > |ei41 — x,| for all 
+, which forces the numbers |z; — 2;—1| to be equal. Since 


n 


SS” (tiv1 — ai) = 0, 


i=] 


there exists i such that 2j4; — a; and x49 — 244 have different signs. But 
then «; = xi;2, which contradicts the hypothesis that 2; are all distinct. The 
result. follows. Qo 


3. Let f € Z[X] be a polynomial of degree n >'2. Prove that the polynomial 
f(f(X)) ~ X has at most n integer zeros. 


Gh. Eckstein, Romanian TST 


Proof. Suppose that x; < x2 < ++. < ayy are distinct integers such that 
f(f(xi)) = 2. Then f(z;) are also pairwise distinct and fei) ~ f(z;) isa 
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multiple of a; —x;. But 2; — aj = f(f(2:)) — f(f(x;)) is also a multiple of 
f (ai) ~ f(;). Thus, we must have |f(«i) — f(«y)| = ja; — x]. But then 


R Rn 
Lf Geist) ~ f(z) = SO bea — wil 
j=l t=1 
= Xnt+1— © 


i tecei-<teal 
= Sof (wen) — f(@))), 
i=l 


which implies that all numbers f(xi41) - f(z) have the same sign. Combined 
with the equality ! f(a) — f(@iqi)| = jai ~ vig1| this shows that either all 
numbers f(x;) + x; are equal or that f(;) — «; are all equal. This is however 
impossible, as f has degree n. o 


Remark 10.1. A very similar problem was given at the IMO 2006: let P(X) be 
a polynomial of degree n > 1 with integer coefficients and let & be a positive 
integer. Consider the polynomial 


Q(X) = P(PC-- P(P(X))))- 
Netto aan” 
k times 
Prove that there are at most n integers such that Q(t) = ¢. It follows from 
problem 2 that any integral solution of the equation Q(t) = ¢ is a solution of 
the equation P(P(t)) = t, so this new problem is actually equivalent to the 
previously discussed one. 
The following two problems are a bit trickier than the previous ones. 


4, Find all integers n > 1 for which there is a polynomial f € Z[X] with 
the property: for any integer k one has f(k) = 0 (mod n) or f(k) = 1 
(mod mn) and both these equations have solutions. 


Proof. The answer is: exactly the powers of a prime number. If 7 is a prime 
power, one can see from Euler’s theorem that the polynomial X° is a solu- 
tion. Conversely, suppose there exists a polynomial f with the properties 
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in the statement and that n has at least two different prime factors p,q. 
Changing f to f(X ~ a) for a suitable a, we may assume that f(0) = 0 
(mod n). Thus (0) = 0 (mod g), which implies that for all b € Z we also 
have f(bq) = 0 a q). Thus, we cannot. have aes = 1 (mod n) and so 
f(bq) = 0 (mod n). In particular, f(bg) = 0 (mod p). But then for all a € Z 
we also have f(ap + bq) = f(bg) =0 (mod p) and so f(ap + bg) = 0 (mod n). 
Since any integer z € Z can be written ap + bg for suitable a, 6, it follows that 
the equation f(x) = 1 (mod m) has no solutions, a contradiction. Oo 


5. Find all polynomials f with integer coefficients such that f(n)|2" ~1 for 
all positive integers n. 


Polish Olympiad 


Proof. Of course, if f is constant, then f has to be either 1 or —1 and these are 
solutions. So, assume that f is nonconstant. We may assume that the leading 
coefficient of f is positive. Take n such that f(n) > 1 and let p be a prime 
factor of f(n). Then p|2” —1 and also p|f(n+p)|2"+” ~ 1, But then p[2? -- 1, 
which is certainly impossible. So any such f is constant and the solutions are 


J; =, a 


G6. Let f € Z{[X] be a nonconstant polynomial. Prove that the sequence 
(3°) (mod n) is not bounded. 


Proof. Changing f with its opposite, we may assume that the leading co- 
efficient of f is positive. So, given N, there exists m such that f(3") > N. 
Choose a prime p > 2f(3") and observe that {(3?”) = (3) (mod p) by Fer- 
mat’s little theorem. In particular, if r = f(3"?) (mod mp), then r = f(3") 
(mod p) and so r > f(3") > N, finishing the proof. oO 


7. Is there a nonconstant pobmouie) - : Z[X] and an integer @ > 1 such 
that the numbers f(a), f(a”), f(a°),... are pairwise relatively prime? 


Saint Petersburg 1998 
Proof. Assume that f is such a polynomial and a is as in the statement. 


Let g = ged(a, f(a*)) = ged(a, f(0)). If g > 1, then g is a common fac- 
tor of all f(@?). Thus g = 1, so a and f(a') are relatively prime for all 
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i. Choose an i with |f(a')| > 1 and choose j = i+ y(|f(a*)|), where 
is Buler’s totient function. Then f(a‘) divides a — a’ by Euler's Theorem, 
so f(a’) divides f(a’) ~ f(a*) and f(a')|f(a7). But this contradicts the fact 
that ged( f(a"), f{a7)) = 1 and shows that the answer to the problem is nega- 
tive. i 


The next problem is a variation on the following very classical (but non- 
trivial} problem: what are the polynomials f ¢ Z[X] such that f(p) is a prime 
for all prime numbers p? 


8. Find all polynomials f with integer coefficients, having the following 
property: there exists & such that for all prirues p, f(p) has at most k 
prime factors. 


Proof. All polynomials of the form 4X™ work, for obvious reasons. Let f be 
a solution of the problem and suppose f is not of this form. So we can write 

£(X) = Xg(X) for some polynomial g with integer coefficients such that 

a # 0 and g is not constant. By Schur’s theorem, there are infinitely many 
tains dividing at least one of the numbers g(1), g(2),.... In particular, we can 
choose pi, p2,...,Pk41 distinct primes greater than |g(0)| and we can choose 
positive integers a1, a2,.-.,@41 such that g(a;) = 0 (mod p;). Using the Chi- 
nese Remainder Theorem, we can find an integer a such that a = a; (mod p;) 
for all i. Note that a is relatively prime to pypz--- py, (otherwise some p; 
would divide @, so pila; and then pilg{Q), a contradiction). Thus, by Dirich- 
let’s theorem, ther e exist infinitely many primes p = a (mod pip +++ pei). In 
particular, we can choose such a prime p with f(p) 40. Since by construction 
f(p) is a multiple of pypo---pyy1, it follows that f is not a solution of the 
problem. The result follows. oc 


Proof. Note that all polynomials of the form f(X) = aX™ work. Suppose 
F(X) = X™g(X) is such @ polynomial which is not of this form. Then g(X) 
is also such a polynomial. Thus we may assume that f is nonconstant and 
J(0) #0. Let p be a prime not dividing f(0), hence p does not divide f(p). 
By Dirichlet’s Theorem there are infinitely many primes g of the form g = 
pt+kf(p)*. Choose such a q with |f(g)} > \f(p)|, For such a q we have 
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f(q) = f(p) (mod f(p)*). Thus f(q) is divisible by every prime factor of f(p), 
with exactly the same multiplicities, and at least one additional prime factor. 
Iterating this construction, we can find a sequence of primes (p,) for which 
£ (pn) has at least n prime factors. Thus the only solutions are the ones already 
given. Oo 


9. Find all polynomials f € Z[X] with the property that for any relatively 
prime integers m,n, the numbers f(r), f(nm) are also relatively prime. 


Tranian TST 


Proof. The solutions are the polynomials +X¢ for d > 0. Trivially, these are 
solutions of the problem. Consider now any nenconstant solution f. Suppose 
that p is a large prime for which p does not divide f(p). Then p and p+ f(p) are 
relatively prime and thus f(p) and f(p+ f(p)) are relatively prime. However, 
f(p) divides f(p-+ f(p)). Thus necessarily |f(p)| = 1 or f(p) = 0. Since p was 
large, none of this happens (f is nonconstant). Thus, for p large enough we 
have p|f(p), forcing pi f(0). This implies that f(0) = 0 and so we can write 
F(X) = Xg(X) for some polynomial g with integral coefficients. Of course, g 
satisfies the same property as f and has smaller degree. By an easy induction 
on the degree of f, we deduce that f is indeed of the form +X@ for some d. 


Proof. We prove first the following easy lemma: 


Lemma 10.2. If f € Z[X] is not of the form +X", then there exist different 
primes p,q such that ql f(p). 


Proof. Assuming the contrary, we may assume that the leading coefficient of 
f is positive. So, for all large enough p, f(p) = p*r for some integer kp. Since 


kp = he fin converges to deg(f), it follows that ky, == deg(f) for all sufficiently 


Coming back to the proof, choose a solution f of the problem and suppose 
that f is not of the form +X”. Pick primes p,¢ as inthe lemma. Then q divides 
f(p) and f(p + q) (since q¢ divides f(p + g) — f(p)). But this is impossible, 
since p, p+ are relatively prime and so f(p) and f(p+q) are relatively prime. 
The result follows. Oo 
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It is a classical fact that for any integer a and any positive integers m and 
nm we have 
ged(a™ — 1, a — 1) = a8") _ 1, 


A sequence (dn)n with the property that ged(@m.@n) = Ggeatmn) is called 
a Mersenne sequence. Thus (a” — 1), is a Mersenne sequence. In chapter 
3, problem 3 we proved that for any Mersenne sequence a, there exists a 
sequence of integers 6, such that a, = Than bg. The next problem shows 
how to construct a Mersenne sequence by iterating a polynomial with integer 


coefficients. 
y 10. Let f be a polynomial with integer coefficients and let a9 = 0 and 


an = f(Qn—1) 
for all n > 1. Prove that (a, ),>0 is a Mersenne sequence. 


Romanian TST 


Proof. Let g = f%, the composite of f with itself d times. If m = du and 
n= du, with ged(u,v) = 1, are any positive integers, note that an = g”(0), 
am, = g"(0) and ag = g{0). So, it is enough to prove that for any g with 
integer coefficients and any ged(u,v) = 1 we have ged(g*(0), g"(0)) = g(0). 
Let us remark that for any polynomial h with integer coefficients and any k 
we have h(0)|h*(0). This is obvious. Applying this to g already shows that 
g(0) divides ged(g*(0), g’(0)). Conversely, let 2 = ged(g"(0), g”(0)). Applying 
the previous remark to h = g” and h = g”, we obtain that for all A, B > 1 we 
have z|g4"(0) and z|g®"(0). Taking A,B such that Bu = Au +1, we obtain 
that z|g(g4"(0)) and zig4"(0). It is then clear that x divides g(0) and the 
conclusion follows. 


11. Find all positive integers k such that if a polynomial with integer coef- 
ficients f satisfies 


OS £0), FQ) fH+N Sk, 


then f(0) = f(1) =-+ = f(k) = f(k +1). 
IMO 1997 Shortlist 
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Proof. if k < 2 we can easily find counterexamples, for instance f(X) = 
X(2—X) fork = 1 and f(X) = X(3— X) for k = 2. So, assume that k > 3 
and let f be a polynomial with integral coefficients such that 


O< f(0), fQ),..., f(k+) <k. 


As f(k+ 1) ~ f(0) is between —k and k and since it is a multiple of k + 1, we 
must have f(0) = f(k +1). Thus, we can write 


£(X) ~ FQ) = X(X =~ (k + 1))g(X) 


for some polynomial g, which clearly has integer coefficients (note that X 
divides f(X) ~ f(0) and X is relatively prime to X ~ (k +1). 

Thus, f(0) + é(i — k ~ 1)g(é) is between 0 and k for all O <i < k4+1. 
In particular, we have k > i(k + 1 ~ 4)|g(#)| for 4 in this range. However, 
i(k+1—i) > k for 2 <i< k-~1, so that g(i) = 0 for2<i<k-1l. In 
particular, we can write 


Q(X) = (X ~2)-- (KX ~k+ 1)A(X) 
for some polynomial h, again with integer coefficients. But then 


F(k) ~ F(0) = —k(k ~ 2)th(k),  f() ~ (0) = (— 1) Ek ~ 2)1h(A). 


This implies that k(k — 2)!h(x)j < k for x € {1,k}, which clearly implies that 
h(1) = h{k) = 0, unless k = 3. Therefore, unless k = 3 we can definitely 
conclude that f(0) = f(1) = --- = f(k+ 1) and so all k > 4 are solutions 
of the problem. It remains to study the case k = 3. But we note that we 
can actually have equality in all previous inequalities, yielding without much 
difficulty the polynomial f(X) = X(X — 2)?(4—X). This proves that & == 3 
is not a solution and the problem is solved. 


Fermat’s little theorem implies that p divides 2? — 2 for any prime p. The 
following question is then rather natural to ask, but not easy to answer. 


12. Find all polynomials f with integer coefficients such that f(p)|2? ~ 2 for 
any odd prime number p. 


Gabriel Dospinescu, Peter Scholze 
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Proof. We will use the fact that f(p) divides f(p+kf(p)) for any k and choose 
k such that p + kf(p) is again a prime. Of course, this is not possible unless 
p is relatively prime to f(p). 

Since f(X) = X is a solution (by Fermat’s little theorem), a reasonable 
first step will be to prove that any solution satisfies f(0) = 0. This is the 
hardest step. Assume for a moment that we proved that any nonconstant 


satisfies the same conditions. So either g is constant or it is a multiple of X. 
Continuing like this we obtain that f(X) = +2°X° for some a, > 0. Clearly 
a <1 and by taking p = 3 we obtain that b < 1. Thus, all solutions are of the 
form f(X) = £2¢X° with a,b < 1 (these are clearly solutions of the problem). 

Now, let us prove that if f is nonconstant, then f(0) = 0. Assume that 
(0) # 0 and, by changing f with —f, that the leading coefficient of f is 
positive. Then for large primes p, p is relatively prime to f(p) (as if p divides 
f(p), then p divides f(0)). By Dirichlet’s theorem for such a prime p we 
can choose infinitely many k > 1 (depending on p, of course) such that ¢ = 
p+kf(p) is again a prime. Then f(p) divides 2? —2 and also 2°t*f) 9. But 
then f(p) also divides 2(28¢4(—1e-1+#/(?}) 1). As f(p) = f(1) (mod p~ 1), 


. we deduce that f(p) divides 2(28°4?-14F@) — 1), Now, if f(1) = 0, all this 


work will accomplish nothing, but the good news is that: f(1) cannot be 0: 
otherwise, f(y) would be a multiple of p — 1 and so p~ 1 would divide 2° ~ 2 


there is a big obstacle in the previous approach: k depends on p and we have 
absolutely no control on it. The crucial remark is that we can however control 
ged(p ~1,kf(1)) by choosing more carefully k. This is the object of the next 
paragraph. 

We will choose k of the form k = 2+ r(p — 1) for suitable integers r. In 
order to do that, we need to ensure that p+ f(p)(2+1r(p—1)) is a prime. This 
is realized for infinitely many r if p + 2f(p) is relatively prime to (p ~ 1) f(p). 
Now, if] is a prime factor of p+2f(p) that divides (p—1)f(p), then we cannot 
have 1] f(p) (as otherwise ! divides p and so | = p divides f(p), a contradiction), 
so I divides p~ 1 and also 1 + 2f(1) (since f(p) = f(1) (mod p—1)). But we 
can now do the following: choose from the beginning large primes p such that 
p~1is relatively prime to 1+-2f(1), for instance primes p = 2 (mod 1+2f(1)). 
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There are infinitely many such primes by Dirichlet’s theorem. For such P, we 
Saw that p+ 2/1 {p) is relatively prime to (p ~1)f(p), so we can indeed choose 
& of the form 2+ r(p~ 1) such that Y= p+kf(p) is a prime. The previous 
Paragraph shows that f(p) divides 2(28¢4p-LR F(A) 1). But ged(k, p— D9, 
80 that f(p) actually divides 22/00 _ 1. But since f(p) can take arbitrarily 
large values and since f(1) 4 0, this is a (so desired) contradiction. This shows 
that f(0) = 0 and then the problem js solved by the first paragraph. ia 


10.2. Derivatives and p-adic Taylor expansions 


Most of the time when working with congruences mod p it is enough to 
know that f(@) = f(6) (mod p) whenever a = b (mod p). However, when 
dealing with congruences with prime power moduli, one is sometimes obliged 
to use more precige estimates. Assume that fe ZLX] has degree n and 


which is valid for any complex numbers z,h. Note that L@ is an integer. 
Indeed, by linearity it suffices to Prove this when f(X ) =X” and in this case 


fF (c) R\ 
Ef (es 


It follows easily from this that we have an equality in ZX] (thus in R{X] for 
any commutative ring R) 


2 tk) 
FX) = SEM or _ yyy, 
k= 


In particular, we have the very useful congruence f(a +b) = fi (a) + bf"(a) 
(mod 5?) whenever f € ZX] and a,b€ Z. We give a few applications of these 
ideas in this section. 


is 495 
10.2. Derivatives and p-adic Taylor expansions 


et p be @ prime and let Z[X} be a polynomial. I A ab eee 
e a » Hs f (0) i ( , 
3. Let p be ime and let f € Z[ ry: . 
f P 1) give distinct. remainders when divided by p, prove that the 
Ate Ss f(Q) f (1) f (ip E 9) give distinct remainders when divided 
» yeti 


eet 
Putnam 2008 


Proof. Assume that f(i) = f(j) (mod p*) for some 4,7. a yhe 5 7 o) 
(mod p*) and since f is injective mod p*, we deduce that i = j wie Bes i 
j=itp’k. It is enough to prove that k = 0 (mod p). Assume th 


not the case. We have 
2 pty: oS 
f(t) = FG) = fi + kp?) = f(a) + kp’ sf") med p), 
so p divides kf’(i), hence p divides f(z). But then 
fit kp) = f(t) + kpf'() = f(@) (mod p*), 


ields i sd Ye adic- 
which, combined with the hypothesis, yields i +kp st ih : da contrac a 
tion. ‘Thus k = 0 (mod p) and i = 7 (mod p*). The result follows. [ 


14. Let P be a polynomial with integer coefficients such that P(0) = 0 and 
gced(P(0), PQ), P(2),...) = 1. 
Show that there are infinitely many n such that 
ged( P(n) — P(0), P(n +1) — PQ), P(n + 2) ~ P(2),...) =n. 
USA TST 2010 


Proof. Let us try to study first 
dy, = ged(P(n) — P(0), P(n +1) ~ P(1),...) 
a : yok ke stor of de, 
for any polynomial P with integer coefficients. Let. g be a prime factor of dp 


so that P(n-+-k) = P(k) (mod q) for all k, i.e. P is n-periodic Pee = 
P is also g-periodic modulo g. Thus, if ged(g,n) = 1, then P is 1-periodic 
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modulo g (by Bézout’s lemma) and so q divides P(n +1) ~ P(n) for all n. 
Then g divides P(n) — P(0) for all n, so if P(0) = 0, then q must divide 
ged(P(0), P(1),...). In particular, for our polynomial we must have qin for 
any prime factor g of dy. 

The previous paragraph suggests taking for n a power of a prime, say 
n=p%. Then we saw that d,, is also a power of p. Note that dp is a multiple 
of n, since n divides P(n +k) ~ P(k) for all k. It remains to see if we ean have 
PN UP(k +p) — P(k) for all k. Since 


P(k+p™) = P(k) +p" Pk) (mod p*+4), 


this would imply that p divides P’(k) for all k. Now we see how to choose our 
numbers n: pick and fix once and for all a value & such that Pk) #0. For 
all sufficiently large p, p does not divide P’ (k). For any such p, the previous 
arguments show that d, =n for all n =p’. The conchision follows. o 


Before tackling the next problem, we need to recall a very standard result, 
known as Lagrange’s theorem, Consider f € ZX] and a prime p. Let f € 
F,[X] be the reduction of f mod p and assume that F #0. As F, is a field, 
it follows that f has at most deg f < deg f distinct roots in F,. Therefore, 
if f # 0 (which is equivalent to the fact that at least one coefficient of f is 
not a multiple of p), then the congruence f(x) = 0 (mod p) has at most deg f 
pairwise distinct solutions. The next result is a generalization of this classical 
theorem; it will also be used in the solution of problem 20. This was one of 
the problems proposed in the Iranian TST 2011, but the result is much older, 
see for instance theorem 27, chapter II of the beautiful book [2i}. 


15. Let p be a prime, k a positive integer and f € ZX] such that p* di- 
vides f(x) for al z € Z. fk < p, prove that there are polynomials 
Gs 900-59 € ZX) such. that 


k 
£(X) = Sk ixr — AF gi). 
iO 


Proof. The proof is by induction on k. fk = 1, perform the division algorithm 
in Z[X] for the polynomials f and XP — X (which we can do, as X? ~ X is 
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monic) to find q,r € Z[X] such that f(X) = (X? ~ X)q(X) + r(X) and 
degr <p. Then p divides r(a) for all integers « (by Fermat’s little theorem 
and the hypothesis) and the result follows from Lagrange’s theorem. Assume 
that the result holds for k and that k+1 <p. If p**! divides f(«) for all 2, then 
by the inductive hypothesis we can write f(X) = Ses p(X? — X)* gi X) 
for some gj € Z[X]. Pick any integers x and z and write x? —« = py for some 
integer y. Then (# + pz)? — (a + pz) = p(y ~ z) (mod p?), thus 


k k 
f(at+p2) = So pk (y ~ z)'oi(x + pz) =p" Sy ~ 2)'gi(@) (mod p*). 


i=0 ist 


Therefore the hypothesis on f implies that p divides Dan zgi(x) for all in- 
tegers z. Using the fact that k +1 < p and Lagrange’s theorem, it follows 
that p divides g;(x) for all i and all x. By the case k = 1 we can write 
G(X) = (XP — X)Ai(X) + pri(X) for some hy, r; € Z[X]. Replacing these ex- 
pressions in f(X) = S7*9 p*-#(X? — X)'- gi(X) yields the desired result. C1] 


10.3 Hilbert polynomials and Mahler expansions 


Consider the polynomial 


(*) _ X(X-1(X-2)--( 


n nt 


known as the nth Hilbert polynomial. It is clear that this polynomial has 
rational (not necessarily integer) coefficients and it is not difficult to check that 
it takes integer values at all integers (note that «(#—1).--(r-n+1) =nl(®) 
if z > 1, while 


od) 


n 


a(@—i)-(@=ntlH= -( 


if « <0). These polynomials play a fundamental role in the theory of integer- 
valued polynomials because of the following beautiful and classical result of 
Polya: 
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Theorem 10.3. Let f be a polynomial of degree n, with real coefficients. Then 
F(Z) CZ éf and only if there exist integers ag,a1,02,...,4n such that 


H(X)= 0 +01(7) +a(3) tetan(*), 


Proof. One implication follows from the previous discussion, so assume that. 
f(Z) CZ. The polynomials (*), (¥),...,(%) have degrees 0,1,...,n, thus 
they form a basis of the R-vector space of real polynomials with degree at 


most n. Thus there exist unique real numbers ag,a1,....@,, such that 
n 
xX 
A(X) = Yaw: (;): 


Consider the operator Af(X) = f(X + 1) — f{X) and observe that 


a(n) (ena) 


Applying A successively to the relation 
4 n 
x 
Hx)= Doe (FZ): 
k=0 


we deduce that a; =: A’ f(Q) for all 7. On the other hand, an immediate 
induction shows that 


k 
k See fF : 
ayn) = Don (Px +9. 


Thus, if f(0), f(1),..., f() are integers, then so are ay,a1,..., Gp. The result 


follows. ea] 


The proof of the previous theorem shows that 


k 
k 
= eoe(*) 40). 
i> () 
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We call a; the Mahler coefficients of f and we refer the reader to the addendum 
3.B for more details on these coefficients, which play an absolutely fundamental 
role in p-adic analysis. Another consequence of the proof of theorem 10.3 is 
the following very useful result. 


Proposition 10.4. If f is « polynomial of degree n with coefficients in an 
arbitrary commutative ring, then 


ntl 


Sep (" : |] f(X+k)=0. 


k=O 


Proof. The left-hand side is A"*+' f(X), However, we have deg Ag < deg g for 
any nonzero polynomial g. We deduce that A?*! f is the zero polynomial and 
the result follows. Oo 


The next problem is an immediate application of theorem 10.3. The reader 
might try te solve it in a different way and he will realize that the problem is 
actually quite tricky. 


YI6. Let x be a positive integer. What is the least degree of a monic polyno- 
mial f with integer coefficients such that n|f(&) fer any integer k? 
Proof. Let d = deg f. By theorem 10.3 we can write £00) Shy ae ({) for 
some integers a;. Considering the leading coefficients in this equality shows 
that d!is a multiple of n. On the other hand, if d! is a multiple of n, then 
we can take f(X) = X(X + 1)---(X +d—1). Thus the answer is: d is the 
smallest integer such that d! is a multiple of n. O 


¥ 17. Let f be a polynomial such that f(n) € Z for all n € Z. Prove that 
for any integers m, the number lem/1,2,...,deg(f)] - Len) (0) is an 
integer. 


MOSP 2001 


Proof. Fix m,n distinct integers and let d = m—n and g(X) = f(n+X). Then 
g has rational coefficients, sends integers to integers and deg(f) = deg(g). So, 
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we need to prove that 
~ 90 
lem{1, 2,...,deg{g)} - 94) — 90) 
is an integer. Let D = degg, so, using theorem 10.3, there exist integers 


0, 1,-.-,@p such that. 
D x 
aX) = a(7). 
i=O 


It is thus enough to prove that for any 1 <i< D we have 


1 
lem[1,2,...,D]- (1) € Z. 


But the left-hand side is equal to temft 2D] oy which is clearly an integer. 
The result follows. 


18. Let f be a polynomial of degree d such that f(Z) C Z and for which 
f(m) - ; (x) is a multiple of m —n for all 0 < m,n < d. Prove that 
f(m) ~ f(n) is a multiple of m ~ n for all integers m,n with m £ n. 


Holden Lee 


Proof. Since f(Z) C Z, theorem 10.3 yields the existence of integers ag, a1,.. 
ag such that 


‘s 


d 
xX 
#(%)= Soa (Z). 
k=0 : 
The result follows now from the previous problem and the following general 
result. 


Lemma 10.5. Let a; € Z and let 


Sto Xe 
= Saul ) and L, = lom(1,2,...,k) 
k=0 k 


(by convention Ly = 1). Then the following assertions are equivalent: 
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a) m—n divides f(m) ~ f(n) for lO <mAén<d= deg f. 
b) Ly divides ax for lO <k <d. 
c) m~n divides f(m)— f(n) for allmAn€é ZB. 


Proof. Suppose that a) holds. We will prove by induction on i that ZL; di- 
vides a;, This is clear for i = 0, so assume that ao,...,@j.1 are multiples of 
Lo, La,..., Li.) and fix 0 < 7 <i. Then j ~ é divides 


19-10)-¥a4(i)-G))- Em (()-()- 


By the previous problem and the inductive hypothesis, each of the numbers 
ak (@) a @)) with @ < & < iis a multiple of i~ 7. We deduce that i -j 
divides a; and since 7 < i was arbitrary, it follows that L, divides a;. Hence 
a) implies b). The previous problem shows that b) implies c) and since it is 
trivial that c) implies a), the result follows. We are done. 


Remark 10.6. The fact that the polynomial f(X) = ah a, - (*) satisfies 


£ 2 € & for alla #b€e Z if and only if lem(1,2,..., &)Jay for all k seems 
to have been first noticed by R.R.Hall and 1.2. Russa: 


The following problem strongly suggests using Lagrange’s interpolation 
theorem, but there are some difficulties in making the argument work, since the 
polynomials appearing in Lagrange’s formula don’t have integer coefficients. 
The problem is quite tricky and makes again use of Hilbert’s polynomials. 


19. a) Prove that for all positive integers n there is a polynomial f € Z[X] 
such that all wumbers f(1) < f(2) <--- < f() are powers of 2. 
b) Let a > 1 be an integer and let n be a positive integer. Prove that 
there exists a polynomial f of degree n, having integer coefficients, 
such that f(0), f(1),..., f(n) are pairwise distinct positive integers, 
all of the form 2a* + 3 for some integer k. 


Chinese TST 2004 
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Proof. a) We will choose f of the form 


nx) = 4-5) (*) a 
i=D 


for some suitable integers A,B, to be chosen later. By the binomial fornula, 
for any 0 < i <n we have f(i} = A(1+B)!. Now, of course we want f to have 
integral coefficients and unfortunately os ) does not have integral coefficients. 
However, n! (*) has integral coefficients for all 1 <isn. Note however that 
we cannot simply take for B a multiple of n!, since then. A(i+3B)' has no chance 
of being a power of 2. However, we can profit from the presence of A: take A 
to be 2%" and B an odd multiple of the greatest odd divisor of n!. Then the 
previous remarks show that f has integral coefficients, Finally, we want 14+ B 
to be a power of 2. Thus, we can choose (for example) 1+ B =: 2° where d 
is the largest odd divisor of n!. With these choices, f(1) < f(2) <-++. < f(n) 
are powers of 2, as needed. 

b) This uses the same idea as a). Write n! = m-q, where all prime factors 
of m are among those of a and where ged(q,a) = 1. Let 6 = a? — 1, so g 
divides 6. Finally, define 


f(X) = 20" 55 (“\o+s 


imO 


It has integer coefficients because i[ntia™ - & for all 0 <i <n. Moreover, for 
1<k <n we have 


P(i) = 2a (b+ 1) +3 = 2a™tOMF 4 3. 


Remark 10.7. It. is also true that for all positive integers n there is a polynomial 
fe 2X ] such that all numbers f (1) < f(2) <--- < f(n) are prime numbers. 
We will try to find such a polynomial of the form 3 


£(X) = ay (X ~ 2)(X —3)---(X ~n) + aQ(X ~1)(X —3)--(X —n) 
beh ay(X =D (Xm ntl, 
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for some suitable integers a;. The reason is that f(i) only depends on a;, so 
that it will be rather easy to adjust a; in order to make f(i) prime. Indeed, 
note that 
fli) = (-1)" "(m= DIG - lay + 1. 

Now, we will choose the a;’s inductively so that f(1) <-+--< f(n) are primes. 
We will heavily use theorem 9.6, according to which for all n there are infinitely 
many primes of the form 1+ kn. Thus, there exists an integer a; such that 
1+ (—1)""'(n — Dla; = f(1) is a prime. Fix such a, and choose (again by 
the cited result} an integer az such that 1+ a(n — 2)-1)""? = f(2) is a 
prime greater than f(1). Continuing like this, we find a1, a9,...,@» such that 
f(1) < f(QQ) <---< f(n) are primes and the problem is solved. 


20. Suppose that n is a positive integer not divisible by the cube of a prime 
number. Consider all sequences (1, £2,...,%n) with 2; € Z/nZ. For 
how many of these can we find a polynomial f with integer coefficients 
such that f(¢) (mod mv) = a; for all 7? > 

USA TST 2008 


Proof. Let A, be the additive group of those sequences (x1, @2,..-.0n) € 
(Z/nZ)" associated to integer polynomials, as in the problem. We will show 


that the map 


where i 
nls 
f(X) =a0+ Da [](X - 9, 
i=l j=l 


is an isomorphism of abelian groups ® : Th LZ} = Ay. First, 


note that & is well-defined: indeed, since a product of d consecutive integers 
is a multiple of d!, it is clear that the sequence (f(i))i<i<n does not depend 
on the choice of representatives a; for @. 

Let us prove that & is surjective. Repeated division algorithm shows that 
any polynomial with integer coefficients of degree at most d can be written in 
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the form 4 ; 
f(X) = a9 + Sas T](X - 8) 
i=l jel 


for some integers a;. We may restrict to d <n since all a, with k > n do not 
matter when considering f(i) (mod n). This yields the surjectivity. 

It is clear that © is a group homomorphism. It remains to prove that © is 
injective, so suppose f satisfies ff i) = 0 (mod n) for 1 <i <n. We want to 
show that a; is a multiple of REED gy for 0 <k <n. Assuming the contrary, 
there is some least & for which this does not hold. Then we may assume a; = 0 
for j < k (since replacing them by 0 does not change the values of f mod n). 
But then plugging in X = k +1 gives f(k+1) = kla, = 0 (mod n). and a, 
is a multiple of Hea’ contrary to our assumption. 

Thus the number of polynomial sequences (x,,..., 2) is 


nol 


Il n 
per ged(n, k!) 


Ifa prime p has up(n) = 1, then there are p factors in this product (those with 
k= 0,1,...,p—1) which are multiples of p. If a prime p has Up(n) = 2, then 
there are p factors which are multiples of p® (k = 0,...,p—1) and p more 
factors which are multiples of just p (k = p,...,2p—1). Therefore for n which 
are not divisible by the cube of a prime, we have 


v= Tv I» 
Piup(ny=i prtp(n)=2 
Proof. For a polynomial f € Z[X] and n > 1, let 
Sa(f) = (F(1) (mod n), f(2) (mod n),..., f(n) (mod n)) € (Z/nZ)" 


and let A, = {S,(f}|f € Z[X]}. We want to find |A,|, at least when n is 
cube-free. First, we will prove,the following 


Lemma 10.8. {Ay} is multiplicative, i.e. 


[Arial = |Aml = TAn| 


for all gcd(m,n) = 1. 
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Proof. We will exhibit a bijection between Ay,» and A,, x A,- Consider the 
map 
gi Amn + Am x An, 9(Sinn(f)) = (Sn(f), Sa(f))- 


Note that it is well-defined, for if Sinn(f) = Smn(g), we have f(x) = g(x) 
(mod mn) for all 1 < x < mn and so Sm{f) = Sm(g) and S,({f) = Sp(g). We 
claim that ¢ is bijective. Injectivity is very easy, for if y(Sinn(f}) = o(Smalg)), 
then m divides (f — g)(4) for all 1 < i < m and n divides (f ~ g)(¢) for all 
1<i<n. By the division algorithm, it follows that m divides (f — g}(x) for 
all integers x and the same for n. Since ged(m,n) = 1, we have mn f(x) ~ g(x) 
for all z and we are done. For surjectivity, we need to prove that if f,g € Z[X] 
are arbitrary, then there exists h € Z[X] such that A(i) = f{i) (mod n) for 
all 1 <i <n and A{i) = g(i) (mod m) for all 1 < i < m. Simply choose 
h = Amf + Bng, where A,B are integers such that Am = 1 (mod n) and 
Bn =1 (mod m). The lemma is proved. Oo 


Thanks to this lemma and the hypothesis that n is cube free, it remains 
to find |Ap| and |A,2|. The first task is trivial: for any p-tuple (a1,...,a@p) of 
remainders mod p there exists a polynomial f such that f(z) = a; (mod p) for 
all 1 <i <p, by Lagrange’s interpolation formula. Thus |Ap| = p”. Finding 
|A,2| is more delicate, since Z/p’ °Z, is not a field and so Lagrange’s interpolation 
formula is useless. Moreover, there are nontrivial relationships between the 
numbers f(i) (mod p”), which implies that |A,2| is definitely smaller than 
pr. 

The first point is to note that 


Ap = {S2(fLf € Z[X], deat f) < 2p}, 


as for any f € Z[X] one can find g € Z[X] of degree smaller than 2p and such 
that f(x) = g(x) (mod p?’) for all x. Indeed, simply take for g the remainder 
of f when divided by (X? ~ X)*. Let G = {f € Z/P2|X]\dee() < 2p}, an 
abelian group of order ( Rees = p’® and let 8: G + (Z, /p?2)”” be defined 
by S(f} = 7 (), F(2),- p”)). The previous observation shows that A,2 is 
the image of the map S$ : > (Z/p?Z)”. As [Im(S)] = ty. 
to find [Ker(S)}|. We can Sate describe Ker(S) rather explicitly, thanks 


it remains 
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to problem 15. Indeed, this problem (for k = 2) implies that Ker(S) is in 
bijection with the set of polynomials u € F,[X] of degree smaller than p. As 
this set has p? elements, it follows that |A,2| = p*?. The problem is finally 
solved, o 


10.4 p-adic estimates 


In this section we consider a few problems dealing with p-adic properties 
of polynomials. 


W 21. Let (¢n)nz1 be an increasing sequence of positive integers such that for 
some polynomial f € Z[X] we have a, < f(n) for all n. Suppose also 
that m~ nlam ~ dp for all distinct positive integers m,n. Prove that 
there exists a polynomial g € Q[X] such that a, = g(n) for all n. 


USAMO 1995 


Proof. Let d be the degree of f and choose a polynomial P of degree at most 
d with rational coefficients and such that P(i) = a; for 1 <i< d+. This 
is possible by Lagrange’s interpolation formula. Choose (and fix) N > 1 such 
that h = NP has integral coefficients. Then h(i) = Na; and h has degree d. 
Fix any integer n > d+ 1 and observe that m — 2 divides Nam, ~ Ni dy, and 
m~n divides h(m)—A(n). Thus, ifm < d+1, then m—n divides Na, — A{n). 


Lemma 10.9. For any positive integers 21,02, ....2n, lem(ay,42,--.,2n) is 


a multiple of These ae eta) 


Proof. It is enough to prove that for any prime p, the p-adic valuation of 


lom(x1,22,...,2) is at least that of Tose altace: Writing y% = vp(x;), 
Sie jen SOUL: 2;, 


this comes down to the inequality 


max(y:) 2 Soy ~ So min(y,y). 


i<j 
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O 


which is clear (simply order the y;'s). 
Coming back to the problem, we infer that 


(n-d— aja 
i ~I,n-2,....2n—(d+1)) > a ~, 
lata pean Thcicjcass S0U(m ~ i, — 9) 


which is greater than C\n%*} for some constant Cy > 0 and all large n (this 
is because ged(n —i,n ~ 7) divides j — 7). Thus, if nis large enough, then 
necessarily 


lem(n — 1, ~ 2,...,n~ (d+1)) > |Nan ~ A(n)|- 


Combining this with the result of the previous paragraph, we infer that there 
is ny such that for all n > mp we have Na, = h(n). 

Finally, pick any m > 1 and observe that for all n > mo we have m — 
n|Nam — Nay and m~ nlh(m) — h(n). Since Nay, = h(n), we deduce that 
m~ n divides Nag, — h(m) for all‘n > no, forcing Nam = h(m). Thus, we 


proved the existence of a polynomial g = Hh, with rational coefficients, such 
that a, = g(r) for all n. o 


Remark 10.10. There are a lot of non-polynomial maps f such that m— 
divides f(m) ~ f(n) for alim # n. For instance, pick any sequence of integers 
Gm, infinitely many of which are nonzero and define 


f(r) = Soa (n+ Ant j- Ue (n—3). 


20 


it is an easy exercise to check that f satisfies the desired congruences and that 
f is not polynomial. 

Remark 10.11, It is not true that if @, = g(n) for all n, then g € Z[X]. For 
instance, one can easily check that if g(n) = at z = then z a)=a(n) is an integer 
for all different integers m and n. 


22, Consider all sequences (f(1) (mod n), f(3) (mod n),..., (1023) 
(aod n))}, where n = 1024 and f is an arbitrary polynomial with integer 
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coefficients. Prove that at most 2° of these sequences are permutations 
of 1,3,5,..., 1023 (mod n). 


USA TST 2007 


Proof. Define Py(X) = 1 and P;(X) = Th. (X~(2k—1)) fori > 1. Repeated 
euclidean division (taking into account that P; is monic) shows that each 
polynomial f ¢ Z[X] can be written f = a Py +¢,P, +--+ +o,P, for integers 
j, where n = deg f. 

Note that P; ee -Disa ee af 2' il, so by oe : formula we 


=8, aac 
“hes a Saal f is 5 good if if its aaateiated sequence (as in the statement. 
of the problem) is a permutation of 1,3,5,..,,1023. Let 


HX)= ST aP(X) 


OSisn 


be a good polynomial. If we delete the terms with P; for i > 6 (where a; > 10), 
we get a polynomial with the same associated sequence as f, so we are only 
interested in co, ¢,...,¢5. Note that eg is odd and that f(1) # (3) (mod 4), 
as f(1) = f(4k + 1) (mod 4) and f(3) = f(4k + 3) (mod 4) for all k. But 
since 

£(3) ~ FQ) = x (PA(3) — PiG)) = 2c, (mod 4), 


it follows that c; is odd. Finally, note that if we mod out « by 2!°~@, we get a 
polynomial with the same associated sequence as f, so we are only interested 
in the values of ¢; (mod 2!°-%), But since eg is odd, there are at most 2° 
choices for it; for the same reason there are at most 28 choices for ¢, (mod 2°) 
and for 2 <i <5 there are at most 2!0-% choices for ¢; (mod 21° a Hence 
the number of complete remainder sequences is at most 


5 
2.28. | | atenae we 29.98.97. 98. 93 92 938, o 
i=2 


It is a classical result of Mahler that continuous functions on the ring of 
p-adic integers Zp can be uniformly approximated by polynomials (this is the 
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p-adic analogue of a classical theorem of Weierstrass, stating that continuous 
functions on a compact interval can be uniformly approximated by polyno- 
mials), It is not difficult to see that the characteristic function of 2Z2 + 1 is 
continuous, se it can be uniformly approximated with polynomials. The next 
problem gives a more precise estimate. 


$23. Prove that for all n there exists a polynomial f with integer coefficients 
and degree not exceeding nm such that 2" divides f(c) for all even integers 
x and 2” divides f(x) ~ 1 for all odd integers 2. 


P. Hajnal, KGMaL 


Proof. Define O(a) = 1 if x is odd and 0 otherwise. Then we compute that 


(1—(-241)") = s (or 


Since the binomial coefficients are all integers it follows that truncating after 
the nth term gives a polynomial 


wm=r(F )-aF ean 


of degree n such that for all ie aie m, g(m) is an integer with g(m) = O(m) 
(mod 2). Recall that vo(k!) = k — so(k) < k- 1. Therefore every coefficient 
of g has an odd denominator. ce we can choose a polynomial f(X) € 
ZX] of degree at. most n such that every coefficient of f is congruent to the 
corresponding coefficient of g mod 2". Then f(m) = O(m) (mod 2") for all 
m and we are done. a 


O(m) = 5 (1—(-1)") = 


BOE bet 


7 
2 


Proof. Define O(x) = 1 if is odd and 0 otherwise. We want to find a 
polynomial f of degree at most n and such that f(«) = O(2) (mod 2”) for all 
integers «. We will prove the existence by induction, but in order to prove the 
inductive step, we will need the following key result: 
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Lemma 10.12. Let f be a polynomial of degree at most n and let 


F(z) = f(x) ~ O(2), 
Then for all integers « we have 


FPar+ne)s S-yr-* (" vy ') Fe +k) (mod 2”), 
k=O) " 


Proof. Since Ff has degree at most ", proposition 10.4 shows that 
ntl ‘ 
n+l 
ae : )tte +k) =0, 
k=O) : 


Thus, to prove the lemma is remains 0 prove that 2” divides 


nti 


(PFT) ove + by, 
> Cr . 


However, if x is even this is equal to ~2”, becanse 


n+l = 
O(a) <2 


Similarly, if « is odd, then it ig equal to 2", because 


x ( oe ‘) =n. 


k 
This ends the Proof of the lemma. g 
The crucial point that we deduce from th 
f(z) = O(x) (nod 2") for all 0 < x <n, then 


integers «. This follows trivially by induction, 


is lemma is the following: ig 
f() = O(z) (mod 2") for all 


using the congruence in the 


We can now finall 


'Y Prove by induction the exi 
everything is clear, so 


istence of f, For n, = 
assume that we constructed 


=], 
@ polynomial f of degree 
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tm — 1 such that f(x) = O(a) (mod 2"~') for all integers x. We will 
at most ,— : 
tacts our polynomial of the form 


HX) =f(X)+ Sa TP (x- 9. 


k=O §#KOS3Sn 
It clearly has degree at most: n and for all 0 < k < n we have 
g(k) = £(k) + (~1)" Fay - Ri(n — hi). 


By hypothesis, there are integers by, such tha’ F(R) = Ok) + bp - 2 for al 
asis, ; % b, that eT 
0<k<n. Now, since 


va(ki(n ~ k)!) < w(n!) <n-1, 
g re ivi Os 
it is clear that we can choose integers a, such that 2” divides 
i 


21h, + (-1)" ag - k(n — KDE. 


yy the previous resuits. ne inductive step is thus prove tl ler 
by the previot esults. The inductive ste hus proved and the problem 
&, DS 

is solved. Oo 


i avi smentar’ 

We end this section with a challenging problem ee a all seepaery 

4 in ideas are the pigeonhole principle ai dic 

icate solution. The main i P am baie 

Ee a but the way in which they are combined is rather subtle. ae 

eae ah problem (whose proof is very similar) is discussed in chapter 3, 
case of this - 

problem 13. 


iz > integer. Consider the 
= ZX} a polynomial and let @ be an in ; ee 
dana sot 2 meee (an). If az -+ 00 and the set, of prime divisors 
sequence & = @,Gn41 = A , ‘ 
arin is finite, prove that f(X) = AX# for some A,d. 


‘Tuymaada Olympiad, 2003 


: sth ite imes ssume that 
D, Let f¢ be the composition of f with itself d times. an a 
eh pw comprise all prime factors of all numbers ag, a},.... First, we 
PisP2s-+sPN 
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prove that one of the numbers (0), f#(0),..., £(0) is 0. To do this, fix any 
positive integer r and ng such that for all n > no we have a, > (pipa---pn)’. 
Such ng exists by assumption, Thus, for all n > ng there exists i such that 
Up, (Gn) > 7. Considering N + 1 consecutive values n = np,no+1,...,no+N, 
the 7’s that we find take only N possible values, so two of them will be equal. 
Thus, we can find u € {1,2,...,N}, 42 no and k € {1,2,..., N} such that 
Up,(@i) > 7 and Up, (Aizu) > 7. Since aigu = fas) = f*(0) (mod aj), it 
follows that pz divides f“(0). Thus, we proved that for any r we can find 
u © {1,2,...,N} and k such that pt divides f¥(0). It follows that one of 
(0), f2(0),..., £4 (0) has infinitely many divisors and the claim is proved, 

If f'(0) = 0, then working with f! instead of f (the iterates of a under f* 
form a subsequence of the sequence (a,)n, and so will still have only finitely 
many prime divisors), we may assume that f(0) = 0 (it is easy to see that if 
f'(X) is of the form AX for some i > 1, then f itself is of this form). Write 
then f(X) = X@(bo + 6X ++--+6,X*) for some integers bo...., be and some 
d > 1 such that bob. 4 0. ‘Aoutais that the conclusion of the problem does 
not hold, so that e > 1. Note that a, divides a4). Since for each é there 
is nz such that a,, is a multiple of p;, the number dp, +...4ny is a multiple of 
pi‘+-py and so there is np such that for all n > ng we have py ---pylan. It 
follows easily from this that the prime factors of bo are all amongst the p;. 

Consider now some é such that p;|bo. Then 


Up (On4i) = dup, (An) + Up, (Bo + bran + +++ + beak) 
= Up, (Qn) + min(1, vp,(an)), 


from which is trivially follows that vp,(@_,) > oo for n — oo (we know that for 
large enough 7 we have vp,(an) > 0 by the previous paragraph). Now, we are 
finally ready to conchide: let p;,,...,pi, be those primes among p),...,pN 
that divide by. By the previous paragraph, eventually by increasing ng, we 
may assume that that vp, (an) > Un, (bp) for all 1 < 7 < k and all n > np. 
if ¢ is a prime factor of bg + --- + beat, with n > no, then q¢ divides ans1, 
so q is one of the p;’s. But all p;’s divide ap, so g must divide bp and so 
d € {pi,,..-, Pig }- Moreover, since Up, (an) > Upi, (bo) for all 1 < 7 < k, we 
have vp i (Bot ++ b,as) = pi, (89). Putting together these observations shows 


“ 
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that bo + bran + +++ + bea, < bo for all sufficiently large n, contradicting the 
fact that a, -> oo. The result finally follows. Qo 


10.5 Miscellaneous problems 


The following problem is a version of a very classical topic, known as the 
Prouhet-Tarry-Escott problem. It concerns disjoint sets of n integers having 
the same sum of dth powers for all 1 < d < k (k depending on n). 


Y 2. Let d,r be positive integers with d > 2. Prove that for any nonconstant 
polynomial f with real coefficients and of degree smaller than r, the 
numbers f(0), f(1),..., f(@ — 1) can be divided into d disjoint groups 
such that the sur of the elements of each group is the same. 


J. O. Shallit, AMM E 3032 
Proof. Define the sets 
Air) = {x € |0,d" - Ulsa(x) =i (mod d)}, 


where sq(x) is the sum of digits of « when written in base d. We will prove 
that sree Adt (ry f (x) is independent of i, for any polynomial f of degree smaller 
than r. The proof will be by induction on r, the case r = 1 being clear (since 
then each A,({r) has exactly one element and f must be constant). Assuming 
that the result holds for r, observe that. (by linearity) it is enough to prove 
that Se Aufrtt) 2 is independent of i in order to prove the inductive step. 
But, by definition (and considering the last digit in base d) we have 


Ai(r #1) = USE {de + fle © Ai_;(r)}, 


the union being disjoint (and the sets A; being numbered mod d). Thus 


d-1 
Sow] MS wes iy 


seEA(r+l) 9-0 2€Aj..3(r) 
di 
=e SM werd Swe. tPA 
m0 \) 2EApnj(r) @eA;.;(r) 
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Separate in the last sum the term corresponding to x” and observe that it does 
not depend on i, since it is just 


tS) 2". 


BEATA gn {PF} 


‘The other terms are independent of i by the induction hypothesis. Thus 
Leed(r+t)%” is also independent of %, finishing the proof of the inductive 
step. Cl 


Proof. This will use the same sets A;, but the proof that they satisfy the 
desired conditions is different. Let z be any dth root of unity different from 1 
and let 


Aaf(X) = F(X) + 2f(X +a) + PF (K + 2a) +--+ 2h F(X + (A Ya). 


Since 1+ 2+---+ 2%! = 0, we have deg(A,f) < deg(f). 
Let our disjoint sets be Ag, A1,...Ag—1 as in the previous solution and 
consider 


d-1 
A(z) = So 2k | > FX) 
k=O XGA 


By an immediate induction, we obtain that 
A(z) = Agr-1 gra ++ Ma F(X). 


But deg(f) <r, so that deg(Agr-1 Age—2 ++ AgAr f(X)) < 0 and so A(z) = 0, 
Since this holds for all such z, it follows that the polynomial A(t) is a multiple 
of L+¢+---+ 44! and hence Dine d, f(z) is independent of k. a 


The following problem is rather technical, but the idea is very simple: 
after some algebraic manipulations, everything comes down to the fact that a 
nonzero polynomial of degree at most d cannot vanish at more than d distinct 
points. 


10.5. Miscellaneous problems 51 


ot 


26. Let p > 5 be a prime and let a,b,c be integers such that p does not 
divide any of the numbers a ~ b,b — ¢,¢ — a. Let i,j,k be nonnegative 
integers such that i+ 7 +k is divisible by p — 1 and such that for all 
integers x, the number 


(# ~ a)(a ~ b)(x — ¢)|(a ~ a)"(a — bY (a — o)* ~ 1 


is divisible by p. Prove that each of i, j,k is divisible by p ~ 1. 
Kiran Kedlaya and Peter Shor, USA TST 2009 


Proof. We start with some formal reductions, First, note that we may assume 
that 0 < i,j,k < p— 1, as we can replace i,j,k with their remainders mod 
p—1, without affecting the hypothesis or the conclusion (this uses Fermat’s 
little theorem). We want to prove that i = j = k = 0, so assume the contrary. 
By hypothesis, i+ j + k = p—1 or 2(p~1). In the second case, replace each 
x € {i,j,k} with p~ 1-2. As this does not change the hypothesis or the 
conclusion, we can assume from now on that i+7+k = p—1. Finally, we can 
clearly assume that i is the largest among i, j, k. 
Multiplying the congruence 


(x — a)(a ~ 6)(x — ¢)|(a ~ a)'(x ~ be —c)* ~ 1] = 0 (mod p) 
by (x — a)/** and using Fermat's little theorem, we deduce that, 
f(x) = (x — a)(x ~ b)(x — &)[(a ~ 8)? (x ~ c)* — (a ~ a)+*] = 0 (mod p). 
for all integers 2. On the other hand, f has degree at most 


2(p=1) 

ee 
(for p 2 5) and p different roots mod p. Thus f vanishes in F,[X] and we 
deduce the equality (X — 6))(X — c)* = (X —a)** in F,[X]. Note that 
J+k#0,asi<p~landi+j+k=p—1. Thus (X - b)(X —c)* vanishes 
at & or c. But this is impossible, as by hypothesis (X — @)!+* does not vanish 
at either or e. 3 


B34 j4¢k-1<24 
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The next problem is very exotic and tricky. 


27. Prove the existence of a number c¢ > 0 with the following property: for 
any prime p, there are at most cp?/? positive integers n such that p 
divides n! + 1. 

Chinese TST 2009 


Proof. Of course, if pin! +1, then n <p—1. Let p> 2 and let 1 < ni < ng < 

>< 1% < p be all solutions of the equation n! = -1 (mod p). Assume that 
m > i (otherwise everything is clear), The congruences nj! = ~1 (mod p) 
and r:41! = —1 (mod p) imply that 


(ng +L) (rg + 2) ++ (ng + rig. — mi) = 1 (mod p). 
Letting k= ni41 — ny, we see that x = n; is a solution to 
(x + 1)(@ + 2)---(2 +k) = 1 (mod p). 


Since the polynomial (x + 1)(# + 2)---(a-4+k) ~ 16 (Z/pZ){x} has at most k 
distinct roots modulo p, it follows that for each 1 < & < p there are at most 
k indices ¢ such that niga — ng = k. We will prove that this is enough to force 
m < cpl, 

Choose a positive integer 7 such that 


G+)G +2) i+) 
5 . 


2 


>m> 


Since m > ia) = S71, J, when the differences nj,1 — ni are written in 
ascending order, the first is at least 1, the next two are at least 2, and so on, 
each time the next i differences are at least i (this is because for a fixed k, 
lsk<p, niyi — 1m =k has at most & solutions 7). Thus 


m~] +oy 

‘ 4 IG +1)(274+1 
Same PsP df= JEeevUna 
tl 


We deduce that 


(7+ 12 
p>lm—m% > IG 4+ DOQI+ DY 
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In particular, p > & and so 7 < (3p)1/3. 


follows. 


Since m < (j + 1)*, the result 


Before passing to the next problem, let us recall that Bézout’s lemma 
does not hold in Z[X]. However, if f and g are polynomials with integer 
coefficients and with no common complex root, then they are relatively prime 
and by Bézout’s lemma in QUX] there is a nonzero integer ¢ and polynomials 
A,B € Z[X] such that Af + Bg = c. Usually |e] > 1 and it is not at all 
clear what is the smallest possible value of jc], given f and g. The following 
problem discusses the case f = (X +1)" and g = X" + 1. It is a challenging 
problem, combining several ideas in a very tricky way. 


y¥ 28. Let n be an even positive integer. Find the least positive integer k for 
which one can find polynémials with integer coefficients f,g such that 


F(X\CK 4 1)" + g( XY XP 4D =H. 
IMO Shortlist 1996 


Proof. Let us write n = 2"-m for some odd integer m and assume that we 
have 
FXYK FD" + GX) AX" 4+ D = h 


for some f,g € Z[X) and some positive integer k. Taking for X a root 2; of 
the polynomial X?" +1, we deduce that Fei)(ze: +1)" = k. Multiplying all 
these relations and taking into account to []?_,(1 +z) = 2, it follows that 
This f(z) 2" = k?". Since []2, f(z) is an integer (by theorem 9.10), 2” 
divides k*" and so k must be a multiple of 2”. In particular, k > 2. 

We will prove now that k = 2” works. Let us see what happens when 
m = 1 first. We need to find polynomials f,g with integer coefficients such 
that 


F(X X +1)" + G(X X* +1) = 2. 
The idea is to find f such that 


fleet 1% =2 


518 Chapter 10. Arithmetic Properties of Polynomials 


for some root z of X2” +1. Indeed, since X2" +1 ig irreducible over the rational 
numbers (because (X + 1)?" +1 is Eisenstein for the prime 2), this would imply 
that X? +1 divides F(X)(X +1)” ~ 2, whieh would give us g. The key point 
is to take z = e# , because all the other roots % of X?" +1 are of the form 
2), with odd J. Thus, if 2; = 2,..., Zar are the roots of X?" +1, then we can, 
write 2; + 1 = (2 1)Q{z) for some polynomials Q; with integer coefficients. 
And since Ta + 2) = 2, it follows that (1 +2)?” Ean Qilz) = 2 which 
gives us the polynomial f and finishes the proof in the case rm <= 1, 
Finally, it is rather formal to deduce the general case f, 


rom the case m = 1, 
Namely, pick polynomials with integer coefficients f,g such that 


F(X) X + 1) 4g XX i) =2. 
Then 


FOR +" = (2 g( XX" 1)" = 2 4 (KF 4 1)A(X) 


for some h € Z{X]. The last equality 


follows from the binomial formula. Ni ow, 
replace X by X™ in the 


previous equality, to get. 
L(X™y™e Xm + 1% = gm + (x” + 1)Acx™) 


and observe that (X™ + 1)? = (xX + 1)"A(X) for some A € Z[X] (because m 
is odd). The conclusion is now clear. 0 


The idea of the 


following problem is very natural, but there 
technical details one 


are a lot of 
has to deal with, which makes the proof rath 


er long. 
<P 29. Suppose that J is a polynomial of 
coefficient and integer coefficient 
n such that f(n!) is composite. 


degree at least 2, with positive leading 
ts. Show that there are infinitely many. 


IMO Shortlist 2005 


Proof. We will try first to find prime numbers Pp and positive integers n such 
that pl f(n!). Then, we will ensure that n is large enough and finally we will 
have to get rid of the cases f(n!) = 0,p,-—p. Write 


F(X) = agX* + ag 4X? $e bag, 


519 
10.5. Miscellaneous problems 


with ag > 0 and d > 2. Note that we may assume that ap % 0, otherwise the 
problem is trivial. . eee 
, First, let us consider the equation f(n!) = 0 Kaenh aie i Paani 
is forces 3 for 2 = p — k wi . We have ti 
, this forces n <p. So, let us look ee 
nes first (p ~ k)! (mod p), which is very easy by Wilson’s theorem: 


(p~ pt 


p~k)Mp-k+1)-- (p=) 
= (p- k)(~1 4k ~1! (med p). 


-l= 


Thus, we have f(n!) = 0 (mod p) if and only if plz,, where 


2 = ag(k ~ 1)!" + ay(k — 11) b+ ag( 1) 


We will prove first the existence of large prime factors of xp, more precisely 
such that p > k. This is the content of the following 


Lemma 10.13. There exists ko such that for all k > ko, there exists a prime 
factor py of cy such that pp > k. 


Proof. This is easy: choose ky; such that vp((ky — 1) > ie gs arheed 
p < aq}. If all prime factors p of a, are less than k for some eee a y a 
(k— 1)! and zy, so they divide ag. But for such a prime P, Bint 2 ‘ ; ee 
Up(aa), we must have vp(x,) = vp(ag). We deduce that ial < soe a 
chisous ko > ky such that [ax] > |aq| for all k > ko, which is p e 
ay #0. 


Fix now kg and p, as in the lemma. Fix also a positive pore 
assume that none of the mea ; (nt) with mo cone can : _ iat carte 
i, we ssume that « > f(x!)—z is increasing +00). ; 
beri anats et Pr-k > N, then we must have f el ie 
and this will happen if we ensure that k,k-+1,....k + N—-—1 oo od a ‘ 
To have this, we can choose k = kg = a(N +1)! ae for t > Ee : ae 
La = Pk, ~ Ka, we deduce that f(xql) = zg +a(N + 1)! o- ‘4 is na Bi pe 
large a (so that kg > ko). In particular, the last relation shows that a, 


| 
| 
| 
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because the map a ++ zg is injective. In particular, for infinitely many a we 
have %g31 > 2%q_ +1 and so 


F(ta!) ~ ta + (N +1)! = Fast!) - casi > f((xe + DY) -@at+d. 
This implies that 
H(a+1)2) = f(a!) < 14 (N +1), 


* = fs . . 8) 
which is certainly impossible because Ate —> o6 for a 4 oo. Thus our 
Tal : 
assumption was wrong and at least one of the numbers f(x!) with n > N is 
composite. Since N was arbitrary, the conclusion follows. oO 


Remark 10.14. The hypothesis that ¢ > 2 was not used in the proof, so the 
result still holds for linear polynomials with positive leading coefficient. For 
instance, this also solves the following Chinese TST 2011 problem: prove that 
for all positive integers d there are infinitely many n such that d- nl —1 is 
composite. 
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25), Qiaochu Yuan (problem 1), Victor Wang (problem 14), Gjergji Zaimi 
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Chapter 11 


Lagrange Interpolation 
Formula 


It is a standard fact that a nonzero polynomial f with coefficients in a 
field K has at most deg f roots in K. Lagrange’s interpolation formula is in 
some sense a refinement of this standard result: it makes precise the way in 
which deg f + 1 points of K determine the polynomial f. 


Theorem 11.1. Let K be @ field and let ro,21,...,¢n be distinct elements of 
K. Then for any polynomial f € K{X] of degree at most n we have 


$00) = few) TT A. 


LE 
k=0 jek * 


Proof. Simply note that the difference between the polynomials in both sides 
of the desired equality is a polynomial of degree at most n which vanishes at 
2G, 1,...,Ln, thus it is the zero polynomial. Cc 


The following corollary is an immediate consequence of the previous the- 
orem, but it is rather useful in practice, especially when proving complicated 
identities. 
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Corollary 11.2, Let f € K(X] be « polynomial of degree at most n and let 
@G)£1,-.+,%n © K be distinct elements. Then 


n 


k=O Lae (te — 3) 


is 0 of deg f <n and equals the leading coefficient of f otherwise. 


Proof. Simply identify the coefficients of X” in Lagrange’s identity. 


The first two problems are direct applications of Lagrange’s interpolation 
formula. 


T 1. A polynomial p of degree n satisfies p(k) = 2* for all 0 <k <n. Find 
its value at n+ 1. 


Murray Klamkin 


Proof. Using theorem 11.1, we obtain 


p(n +1) 


H 


“$F (e 1r-Fg 
k= 
= rth 4, 


0 


Remark 11.3. There is also a neat solution without use of the interpolation 
formula: consider the polynomial 


po (2+ (2) 4-4(%) 


It has degree n and satisfies f(k) = 2* for all 0 < k < n by the binomial 
formula. ‘Thus we must have f = p and then clearly p(n+1) = 2"+!—1, Even 
though very neat, this solution is not so conceptual. 
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2. A polynomial f of degree n satisfies 


for lO <k <n. Find f(n+1). 
Titu Andreescu, IMO Shortlist 1981 


Proof. This is also immediate using theorem 11.1: 


f(n+1) = a Nar 


jek 


k=Q \ k 


= Sep 
k=O 


(4 
= ie 


3. Prove that for any real number @ we have the following identity 
nm 
S\(- Bue )e- ky =n! 
k=0 
Tepper’s identity 


Proof. Use corollary 11.2 for the polynomial P(X) = (a-— X)" and the points 
tp = k. Cj 


4. Let o pes 


= eed eee 
* os | eT [[(e an aj) 
jek 


Prove that Sy = xp + 41 -+---+ a, and compute So. 
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Proof. Actually, the following method shows how to compute Sp in general. 
‘The idea, is to consider the remainder f(X) of X"+? when divided by 


TI - 2) 
g=0 


and to apply Lagrange’s interpolation formula to it. Identifying leading coef- 
ficients and using the fact that f(a;) = opt, we deduce that Sy is precisely 
the leading coefficient of {(X). Now, for p= 1 we clearly have 


f(X) = x") ~ T] (x - 2;), 


j=0 


so that its leading coefficient is Sho 7 On the other hand, for p = 2 we 


must have 
n 


X28 = Q(X) TTX — 23) + FX) 


joo 

for some polynomial Q. Comparing degrees and leading coefficients shows that 

Q(X) =X + for some constant c. To determine c, we impose the condition 

that deg(f) <n. This implies that ¢ = a0 aj. Then, it is easy to find 
, 2 

the coefficient of X" in f(X) and the answer is (5 z;) — ocicjen THE} 

Actually, we leave as a nice exercise for the reader to prove that for all p we 


have 
S= aftap-. ate, 


a +a2t tan =p 


where in the sum above the a; are nonnegative integers. 


The following problems deal with extremal properties of polynomials. The 
underlying philosophy is that imposing conditions on the values of a polyno- 
mial at sufficiently many points of an interval automatically imposes conditions 
at all other values. Lagrange’s interpolation formula is a very handy tool in 
such situations, but there is an extra ingredient which appears all the time, 
namely the Chebyshev polynomials. Recall that the nth Chebyshev polyno- 
mial T,, is the unique polynomial of degree n such that cos(nz) = T,(cos x) 
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for all x € R, It is not really obvious that T;, exists, but the reader can 
easily check the existence inductively, by establishing the recurrence relation 
Trei(X) + Tri(X) = 2XT,(X). This also implies that the leading coeffi- 
cient of T;, is 2"~!. A fundamental theorem of Chebyshev states that for any 
monic polynomial f € REX] of degree n we have max,e(-1,4|f(#)| 2 get: 
with equality if and only if f tester Tn. This explains why Chebyshev poly- 
nomials are so important. For a proof of Chebyshev’s theorem, see problem 
14 in this chapter. 


5. Let a,b,c be real numbers and let f(z) = ax? + bx +c be such that 


max{}/(41)|, (f()} < 1. 


i) L, 1 
Prove that if jz| < 1 then [f(z)l < ; and jes G) <2: 
Spain 1996 
Proof. Using Lagrange interpolation, we can write 
X?4X X?~X 


£(X) = FOX?) + fQ—— + f-)-— 


We deduce that for all jz| < 1 we have 


f(a) <1 — 


the last inequality being equivalent to ({2| — 3) = 0. Similarly, we find that 


je f(i/e)i 1-24 Lhe ot We co, a 
2 2 


6. Find the maximal value of the expression a? +b? +-c? if Jax? +ba+e| <1 
for all a © [--1, 1]. 
Laurentiu Panaitopol 
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Proof. Define 
A=f(), B= s(0), C= f(-1). 


Then we easily obtain 


Therefore, an immediate computation gives 
2,42, 9 A240 
+ tee DT 2 
. 5 + 2B° ~ BLA + C+). 
ee iA], IBI, IC] = 1, the last expression is ele, 
the obvious estimate for each term of it). Thus. 
To see that it is optimal ch 


arly bounded above by 5 (use 
c we always have q?4. Pre <5 
» Simply take Chebyshey’s polynomial 2X? . L ~ O 
?. Define F(a, b,c) = ja? ; ) | 
> (a,b,c) = max |e? — g72 ” 
a he" ~ az ~ be — el. What is the least possible 
value of this function over R3? . 


Chinese TST 200 
Proof. Let Pate(X) = X38 gx? bx 
[0,3] to [-1, 1] via 
theorem in order t 


~c, The idea is to m, i 
— ; 2a, ap the interval 
ffine map and then to use Chebyshey’s least deviation 


0 bi 
ound from below MAX <<19,3] |Pa,b,c()]. Note that 


Prpe ( sD) . 


is i thi 
a polynomial of third degree with leading coefficient 
eorem gives us the estimate 


max, [Pare ( ee) 


{1,1 


Bt [Pabe(a)| = Pore 


Since Pare sO) 
27/8, Chebyshey’s th 


Thus ) = # and this j 
is F(a, b,c) > 37 and this is optimal, since equality holds for 


5 
Paie = 3p 3(2X/3 ~1), 
where T3(X) = 4X3 3x 
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Remark 11.4. For third degree polynomials, Chebyshev’s theorem is very easy 
to prove using Lagrange’s interpolation formula: if f is a monic polynomial of 
third degree, identifying leading coefficients in Lagrange’s formula yields the 
equality 
 2fQ)  Af(l/2) | 4f(-1/2) _ 2f(-1) 
3 3 : 3 3 
Of course, this can also be trivially checked by hand. This equality and the 
triangle inequality imply that max,ej_11)|f(#)| 2 + for any such f, with 
equality when f(X) = X? — 8x. 
8. Let a,b,c,d € R such that lax* + bx? + cx + dj < 1 for all « € [1,1]. 
What is the maximal value of |ci? For which polynomials is the maximum 
attained? 


1 


Gabriel Dospinescu 


Proof. Choose distinct numbers 2, 21,%2,23 and identify the coefficients of 
X in Lagrange’s formula 


3 
4 AS se 
aX? +X? +X +d= 7 (ae) [] 7 
k=0 ae OR 


We deduce that 
+ tetqt £301 | 


Lr 
lel = Decree ~ a1) (to ~ £2)(&o ~ £2) | 


Lz + Lewy + T3L1 
(xq — 41) (a0 (to ~ #3) 


The problem is to find a 4-tuple (xo,21,22,23) which minimizes the last ex- 
pression. As usual in this kind of problems, a good idea is to choose the points 
where |73(x)| takes its maximal value 1 on the interval [—1, 1]. These are the 
points zg = —llay = ~4,x9 = $ and <3 = 1. It is easy to compute the last 
sum in this case and we find that |c! < 3. Since this value is attained for 
the polynomial T3(X) = 4X° ~ 3X, this is the maximal value. Also, it is not 
difficult to check that equality appears in the above chain of inequalities only 
for T3 and —T3. O 


{ 
} 
| 
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9, Ifa polynomial f € R[X] of degree n satisfies | f(x)| < 1 for all x € [0, 1], 


then | 1 
b(-a) 


Proof. We will use Lagrange’s interpolation formula at the points k/n for 


O<k<n. We have 
(2) <8? 


gogntl = 4. 


which shows that 


PCDSMe Lene 


k=O jth | k=0 


10. Let a > 3 be a real number and let p be a real polynomial of degree n. 
Prove that 
é la’ — p(a}i > 1. 
nO ned a! p@ 21 


Proof..The crucial observation is that we have 


eae ‘)a=a 


pa 

This is simply Lagrange’s interpolation formula written in a slightly changed 
form (though a more conceptual way of seeing this is using the finite differences 
theory). We deduce from this and the binomial formula that 


atl . 
(o— yt = ("2 ) 00) ~ a 


k=O e 
Thus, if [p(k) ~ a*| < 1 for all O< k <n +1, we must have 


la- yet 25 (" + a ex grtl 
k > 


k= 


contradicting the fact that @ > 3. This finishes the sohition. O 
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Remark 11,5. ‘The identity 


for polynomials of degree at most n is extremely useful. We proved it here 
as a consequence of Lagrange’s interpolation formula, but it also follows from 
the theory of finite differences, for a glimpse of which we refer the reader to 
section 10.3, 


11. Let a,6,c,d € R such that |ax® + bz? + cx + dl < 1 for all x € [-1, 1]. 
Prove that 
lal + |b] + fe] + ld < 7. 


IMO Shortlist 1996 


Proof. Let us look at the values of P(X) = aX3 + bX? + cX + at 
~1,-1/2,1/2,1, which are the classical interpolation points in Chebyshev’s 
theorem (for n = 3), Writing 


A=f(Q), B= fQ/2), C= f(-1/2), D=f(-), 


we can easily express a,b,¢c,d in terms of A,B,C,D. An easy computation 
gives 


a +4 D) 58 ©), b= H(A+D)-2(B+0), 
and 
1 4 1 2 
c=-HA-D)+3(B~C), d=-F(A+D)+ (B40). 


This shows that f(a, b,c,d) = jai + |b] + |c] + {dj is actually a convex function 
of A,B,C, D & [~1,1]. Thus it attains its maximum when all A,B,C, D are 
equal to 1 or —1. Now, it is a tedious matter to check that in all cases the 
expression is at most 7. We have equality for the Chebyshev polynomial (or 
its opposite) of degree 3. 
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12. Let A= {re RIX] degp < 3, |p(+1)} <1, (2 a) = i} 
Find sup max |p"(x)|. 
pe |2|S1 
IMC 1998 


Proof. Write p(X) = aX° + bX? +cX +d. Since p"(r) is an. affine function 
on [1,1], the maximum of its absolute value is obtained for 2 = 1 or ¢ = —1, 
Thus, we need to bound in an optimal way the number 


max(|6a + 2b],| — 6a + 28)). 


Taking the Chebyshev polynomial shows that this can be already as large as 
24. But it is always at most 24 (under the given hypothesis on p), since for 
instance using the forrnulas in the solution of problem 11 yields 


16 8 28 20 
6a + 2b = pl) — sp(-1) — Sa(1/2) + | p(-1/2). 


Thus {6a + 2b] < 24 by the triangle inequality. We proceed in a similar way 
to bound | — 6a + 26] or we can observe that if p satisfies the conditions in the 
problem, so does p(-—-2). 


Y 13. Let n > 3 and let f,g € R[X] be polynomials such that the points 
(f(1), 911), (F(2), 9(2)),..-, (f(r), g(m)) are the vertices of a regular n- 
gon in counterclockwise order. Prove that max(deg f,deg g) > nm — 1. 


Putnam 2008 


Proof. It is enough to prove that P(X) = f(X) + ig(X) has degree at least 
n—1. Clearly, we may assume that the regular n-gon has vertices z,2?7,...,27 
forz =e (simply apply a translation and a homothety to reduce the general 
case to this one). So, it remains to prove that if a polynomial P satisfies 
P(t) = # for all 1 <i <n, then deg P > n—1. Assume that this is not the 
case. Using Lagrange’s interpolation formula, we obtain 


“Se TEE 


im? i aa 


aon $ Somired” (8) 
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Dividing both sides by z and using the binomial formula yields (1—z)"~! = 0, 
a contradiction. 


Remark 11.6. There are many other approaches. The most efficient is the 
method of finite differences, which yields 


A 1 pr neewe y("y ‘Pej ~j)=2(2~ yr 40. 


jd 


Another very neat proof considers the polynomial P(X +1) ~ zP(X). It has 
degree at most n — 2, it is clearly nonzero and vanishes at 1,2,...,2~1,a 
contradiction. 

We leave the land of routine problems in the Lagrange interpolation for- 
mula and tackle some more delicate results. The following is taken from [66]. 


14. Let f be a complex polynomial of degree n such that {f(x)| < 1 for all 
€ [1,1]. Prove that [f(x)} < ) (z)| for all k and all real numbers 
z such that |x| > 1. Deduce Chebyshev’s theorem: if f is monic of degree 


n, then 
at) IF@) 2 seaz- 
W.W. Rogosinski 
Proof. Pick for the moment n+ 1 points z,.... zy in [—1,1] and apply La- 


grange’s interpolation formula: 


y= S240) Bek. 


i= Pies alae! | 


Next, differentiate this equality & times, so if we set P;(X) = TialX ~ ay), 
then, 


f¢ % 
f(a) >> 4 aan P(r) 
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for all x. Choose any |x| > 1 and apply the triangle inequality to the previous 
identity, together with the hypothesis that |f(2,;)| <1. We infer that 


But we also have 


so it would be really nice if we could ensure that 


ae nee | 5 PO) 


(ai) | [Riel 


In order to have this, we already need {T,,( me = 1 for all i, which suggests 
taking the old friends 2; = cos (2) for 0 <i <n. Then Ty(ai) = (~1)' and 


we would like to prove that the numbers (~1)° - ao have the same sign. 
If this were pea we would be done by the previous arguments. Note that 
fo > £1 > +++ > a, so that 


Pi(ai) = (wi — x0) (i — €1) +++ (84 — Bi-1)(@i — Bin) +++ (Gi ~ Bn) 

has the sign (~1)*. So, it remains to see if the signs of the numbers ph) (x) 
are independent of i as long as |x| > 1. The crucial remark is that Pp have 
all their roots in [-1, 1]. Indeed, this follows from Rolle’s theorem and the 
fact that P; have all their roots in [--1,1]. But then it is clear that the sign 
of PM (a) only depends on the degree of p®) (which is independent of i) and 
on the position of x with respect to 1 (ie whether « > 1 or ¢ < —1). So, it is 
independent of 7 and we are done. 

Finally, let us prove Chebyshev’s theorem using this result. Suppose that 
\f(x)p < sty for all  € [—1,1]. By compactness there exists ¢ > 2"~' such 
that jef(«)| < 1 for all x € [-1,1]. Using the result established above, we 
deduce that for |z| > 1 we have |ef@(x)| < [7 (2)|. As f is monic of 
degree n and since T;, has degree n and leading coefficient 2"~, this yields 
¢ <2"! a contradiction. Oo 
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¥ 15. The polynomials f,g € R[X] and a € R[X, Y] satisfy 
f(x) ~ f(y) = ala, v)(9(@) — 9(y)) 
for all z,y. Prove that we can find a polynomial h such that 
f(z) = h(g(@)). 
Russia 2004 


Proof. Fix an integer N > deg f. If g is constant, so is f and everything is 
clear. So, assume that g is not constant, thus we can choose N +1 distinct 
numbers %o,%1,-..,N at which g takes different values. By Lagrange’s inter- 
polation formula, there exists a polynomial # of degree at most N such that 
A(g(a;)) = f(a) for all 0 <i< N. Note that by hypothesis 


F(X) — F(a) = aX, 24)(9(X) — o(@i)), 


so that g(X) ~ g(a;) divides f(X) ~ f(z,). On the other hand, g(X) ~ g(a) 
also divides h(g(X)) ~ h(g(xi}) = A(g{X)) — f(z:) (this follows by linearity 
and the fact that g(X) — g(x;) divides g(X)* — g{a;)* for all k > 1). Thus 
A(g(X)) — f(X) is a multiple of g(X) — g(a). Now, the polynomials g(X) ~ 
g(x;) are clearly pairwise relatively prime, since the numbers g(z;) are pairwise 
distinct., Thus A(g(X)) — f(X) is a multiple of TX) — g(x;)), but has 
degree smaller than that of TI o(o(X ) ~ 9(zi)). Therefore f(X) = A(g(X)) 
and we are done. o 


The next problem presents P.J. O’Hara’s beautiful proof [63] of a famous 
inequality of Bernstein. 


16. a) Prove that for all complex polynomials f having degree at most n 
and for all « € C 


ef'e) = 31) +5 Dien aa 


where z, are the roots of the polynomial X” + 1. 
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b) Deduce Bernstein’s inequality: for all complex polynomials. f we 
have |[f"| < nllfll, where |{f|| = max |F(2)I- 


P.J. O'Hara 
Proof. a} The polynomial 


~ f(z) 


9x(X) = aa 


has degree at most n—1 and it is easy to see that g,(1) = 2f’(x). Lagrange’s 
interpolation formula for the points z; yields 


n 
Go(X) = S gel) ye 
ram fe 4 


which combined with the equality! 
Tl@i-a)=ntt=-2 
a#t 


gives 
X"4+1 
a — xX 


ge(X) = — J nige(2) 
i=] 


Taking X = 1 and remembering that g,(1) = zf'(z), we obtain 


n 


"(q) = 4 ) = fle) 
fe) = 5 see) — SO ae 
so we only need to prove that 
5 24 7 
oy (1-a)F 2° 


‘Obtained by differentiating the identity X" + 1= Tx — 2:) and taking X = z. 


oy 


qr 
we 
ar 


Simply choose f(X) = X” in the equality 


of e) = LY“(Ulex) - fag 
i=1 : 


b) By scaling f, we may assume that ||f}] = 1. Choose any |z{ = 1 and 
use the identity in (a) to deduce that. 


’ n ig 22 
OSS +5 la we ay 


The miracle is that all numbers 
27." 1 i 


(1 2)? i (& 7 2) 1. Re(m)—1 


are actually negative, so the previous inequality becomes 


n 2tp, 
If@l<5-- <3 
2 nts (1 — 2g) 
Since we’ve already seen that 
imt (1-2)? 2 


we can now safely conclude that [f’(z)| <n, finishing the proof of this hard 
problem. 


We end this chapter with a beautiful inequality due to Gelfond, who used 
it in his solution of Hilbert’s seventh problem. This famous problem asked to 
prove that a” is transcendental whenever b is an algebraic irrational number 
and a is an algebraic number different from 0 and 1. 


17. Let f be a complex polynomial of degree at most n and let 20, 21,-.., 24 
be the zeros of the polynomial X¢+! .. 1, where d > n. Define 


IIFll = max LF(2)]- 
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a) Prove that if there exist n + 1 pairwise distinct numbers 
fo, £1,-..,%, among 2, 21,.-.,2q such that [f(a;s)| < 3) then 
WAll <1. 

b) Deduce that 

I[Fll -{lgtl s 488 +4e8 1) Fglp. 


A.O. Gelfond 


Proof. a) The first step is rather clear: Lagrange's interpolation formula for 
the points 29, 21,...,2n yields 


vombining this with the triangle inequality and the hypothesis, we obtain for 
jz|=1 


We brutally bound each |2 ~ x,| by 2, so that [],4;|2z — 2,| < 2". The subtle 
part is to deal with ]],.; |x; — aj]. Write 

{yn.---: Yan} = {20)..-, 24} — {%0,-.., a} 
and observe that 


d=—n 


Tie 2)-Tle-vl={ 1] (aj — 2;)) =d+1, 
kus 


oat 25 PR, 


because []_,..,(#i ~ 2;) is exactly the derivative of X4*! —1 at 2;, which has 
absolute value d+ 1. Since [x; — yx | < 2, we infer that 


11.1. Notes 537 


Allin all, each term [] 54; eY can be bounded from above by 


an ad 
gH d+ 


Since there are n+ 1 terms and n+ 1 < d+ 1, we deduce that for all jz| = 1 
we have |f(z)| < 1 and so [/f|| <1. 

b) This part is an easy consequence of the first one. Scaling f and g 
allows us to assume that ||f|] = [[gl| = 1. Assuming that ||fgl| < 2, we 
deduce that for any 0 < i < d we have |f(z:)| < ds or |g(zi)| < ao With 
d = deg(f) + deg(g), it follows that either we can find at least deg(f) + 1 
2'8 such that |f(z)| < He or we can find at least deg(g) + 1 z’s for which 


ig(zi)| < re Both cases are excluded by part a), and the result follows. 


11.1 Notes 


We would like to thank the following people for providing solutions to the 
following problems: Xiangyi Huang (problems 6, 7, 8, 9, 12), Qiaochu Yuan 
(problems 1, 3, 10). 


Chapter 12 


Higher Algebra in 
Combinatorics 


This last chapter deals with applications of linear algebra in combina- 
torics. There is a huge literature on this subject, which is still very active, so 
all we can do in this chapter is to barely scratch the surface and present the 
reader with some interesting elementary examples. Of course, this does not 
mean that there aren’t a lot of difficult problems in this chapter. 

Before passing to problems, let us recall some basic results of linear al- 
gebra. Let K be a field. A K-vector space V is an abelian group endowed 
with an external multiplication K x V - V, denoted (a,v) ++ a-v, which is 
compatible with the additive operation of V and with the operations in K. A 
subspace of V is an additive subgroup which is stable under multiplication by 
elements of K. A family (v;);e1 of elements of V is called linearly independent 
if for all finite subsets SC J and all (a;)ies € K'*!, the relation Sy,25 5-05 = 0 
forces a, = 0 for all s € S. A family (v,)iey is called generating if any vector 
v€V can be written v= Les Gg Us for some finite subset S ¢ I and some 
a, € K. Finally, a basis of V is a linearly independent family which is si- 
multaneously generating. Using Zorn’s lemma, one can prove that any vector 
space has a basis and that any linearly independent family of vectors can be 
extended to a basis. In most of the following problems we will only consider 
finite dimensional spaces, i.e. vector spaces having a finite generating family. 
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The dimension of such a space is by definition the number of elements of a 
basis (it is not obvious, but it can be proved that this does not depend on the 
choice of the basis). 

If V and W are vector spaces over K, a map f : V + W is called linear if 
F(v1 +02) = f(w1)-+ f(va) for all v1, 02 € V and f(a-v) = a-f(v) for allueV 
and a € K. The kernel of f is then the subspace Ker(f) = {v € V[f(v) = 0} 
of V. A fundamental result in linear algebra is the following formula 


dim V = dim Ker(f) + dim Im(f). 


Traditionally, dim Im(f) is called the rank of f. 

Let f: V - V bea linear map, where V is a K-vector space of dimension 
n. If €1,€2,...,€n is a basis of V, the matrix of f in this basis is defined by the 
equalities f(e;) = 74, ajie;. Tt is easy to check that the matrix associated 
to the composition of f and g is simply the product of the matrices associated 
to f and g. Conversely, any matrix can be naturally seen as the matrix of an 
endomorphism of V in the given basis. 

If A is an n x n matrix with entries in some commutative ring R, its 
determinant is defined by 


det A= S$” e(o)aro(1ya20(2) *** Gna(n)s 
oESy 


Sn being the group of permutations of {1,2,...,n} and e(c) being the sig- 
nature of ¢. The most important (and rather surprising!) property of the 
determinant is its multiplicativity, ie. det(AB) = det A- det B. Another im- 
portant property is that A is invertible in M,(R) if and only if det A is a unit of 
R. Thus, if £ is a field, we can test if a matrix is invertible by testing whether 
its determinant is nonzero and this happens if and only if its associated endo- 
morphism is bijective. Let M,(K) be the space of n x n matrices with entries 
in K. If A € M,(K), its characteristic polynomial is det(X - J, — A), a monic 
polynomial of pike n with coefficients in K. Its roots in an algebraic closure 
K of K are called the eigenvalues of A. Hence, if \ is an eigenvalue of A, then 
we can find v € K” such that A-v = X-4v, ie. DH fai Gij¥j = A- v; for all 

. It is not difficult to check that the sum of the eigenvalues of A (counted 
ah multiplicities) is equal to the trace of A, which is by definition the sum 
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of the diagonal elements of A. Also, det A is the product of the eigenvalues of 
A (again counted with multiplicities). It is a little bit trickier to prove that 
the eigenvalues of g(A) (where g € K[X]) are g(A1), 902), -.-, gn). 


12.1 The determinant trick 


We present in this section a few tricky combinatorial problems in which 
the determinant of a matrix plays an important role. 


1. Let n > 2. Find the greatest p such that for all k € {1,2,...,p} we have 


5 Exo) -E Ee), 


oEAy \iel j=] 
where A,,B, are the sets of all even and odd permutations of the set 
{1,2,...,n} respectively, 
Gabriel Dospinescu 
Proof. The first ingredient is the following: 


Lemma 12.1. Let a), 42,-..,dm and 61, 69,...,bm be positive integers and let 
N be a positive integer. Then 


ak + ak +... +ak = bk + ok +--+ Be, 
for ll <k< N if and only if (X —1)%+ divides 
XM 4 X84... 4 KO — XO Xba Xm, 
Proof. This is quite easy: if 
P(X) = XM NM a KOM — dr ba _ Xm, 
then (X ~1)%+? divides f if and only if f(1) = 0 for all 0 <j < N. This 
happens if and only if 


Satay hata -j+)l= Shi ~1)-++ (8: -F+1) 
i=] 


j=1 
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for all 0 <7 < N. As the polynomials X(X 1) (X 5 4 1) for 1 < 
j <N span the same vector space as the polynomials X,X?,...,X% (this is 
immediate), the previous equality holds if and only if 

af tah toes pak = OF + OF pe $ ak, 
for all 1 << k <.N. The conclusion follows. a 


Using the previous result, we deduce that p+ 1 is the multiplicity of 1 as 
a root of the polynomial 


A) = YO x8 — x80), 


oEAR oe By 


F(X) = YF e(o) x7). x22)... 90) — get ay, 


oESn 


Ax)=x™P. TT xs x4 


Isi<jcn 


and since the multiplicity of 1 in X/ — X? is 1 for all i 4 7, we deduce that 
p+1= (§). Thus the answer to the problem is (3) — 1. a 


The next problem is rather strange, both because of the statement and 
because of one of its proofs. We also present a more natural proof, based on 
the inclusion-exclusion principle. : 


2. For a permutation o of {1,2,...,2} let e(o) = 1 if o is even and ~1 
otherwise. Let f(c) be the number of fixed points of ¢. Prove that 


ao) oan nm 
Daag 7 OO sa 
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where the sum is taken over all permutations o of {1,2,....n}. 
Putnam 2005 


Proof. We start with the exotic proof: observe that 


Cee ae [ tore ['( devel) ae 
TF Hay = ef Mpa o 
We recognize the determinant of the matrix A(z),j = z=) under the integral. 
But det(A(z)) can also be computed using row-column operations: add all 
columns to the first one, take out the common factor «+n —1 and then 
subtract the first column (which consists now only of ones) from all the others, 
to make the elements in the first row equal to 0, except for the first one. Next, 
expand the determinant using the first row. We obtain in this way that 


det A(w) = (@+n~1)(@~ 1)". 


Thus 


0 
-[ (a+ n)a" ld 
ay 


—yyrti 
Sar wecls 


= (-ay"t? 2 


n+l? 


which is precisely the desired result. O 


Proof. Here is a more natural proof: we will partition the permutations o 
according to their fixed points. For a permutation o, let Fix(o) be the set of 


its fixed points and let f(a) = [Fix(c)|. As 


eo) wil 
Dia) ed OP 


o k=0 lAl=k \Fixto)=A 
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we naturally try to understand F(A) = Urixta= Aaé(a). We will compute 
F(A) using the inclusion-exclusion principle, according to which 


P(A) = SU (PIS FC). 


ACB BCC 


On the other hand, 


S$" F(C)= SY elo). 


BCC o|B=1 


The notation o|B = 1 means that o() = 6 for all b € B. Now, permutations 

such that o|B = 1 correspond to permutations of the complement of B, so 

if n —|B| > 2 we have Dojpaie(7) = 0. We deduce that pce F(C) = 0 

unless |B] > 1 ~ 1, when it is equal to 1. Combining this with the previous 

formula (inclusion-exclusion principle) yields F(A) = (—1)""/4I(|Aj + 1 n). 
We deduce that 


_pnek{M\K+1L~2 
cu) ear 
=. 1 n 
~ ——(, }(-1)""* 
nad)! ) 


and the result follows immediately. o 


The next problem crucially uses the multiplicativity of the determinant. 
It is (rather amusingly) a consequence of the irrationality of V5. 


3. Is there in the plane a configuration of 22 circles and 22 points on their 
union (the union of their circumferences) such that any circle contains 
at least 7 points and any point belongs to at least 7 circles? 


Gabriel Dospinescu, Moldavian TST 2004 
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Proof. The answer is negative. First, we will use the standard trick of counting 
pairs (P,{Ci,C2}), where C,C2 are distinct circles among the 22 ones and 
P is a point among the 22 ones that belongs to Ci M C2. Each point belongs 
to at least 7 circles, so for each P there are at least (6) sets {C1,C2} living 
in a triple with P. So the number of triples is at least 22 - (). On the other 
hand, for each {C}, C2}, CM C2 has at most 2 elements, so there are at most 
2- (7?) triples. The miracle is that we actually have 2- (2) = 22- (2). Thus, 
all previous inequalities are actually equalities. 

Let P,, Po,..., Pag be the points and let Cy, C2,...,C22 be the circles. 
Consider now the matrix A whose (i, j)-entry is 1 if P; € C; and 0 otherwise. 
The previous paragraph shows that P - Pt is the matrix whose elements on 
the main diagonal are equal to 7, all other elements being equal to 2. An easy 
computation shows that the determinant of this matrix is 49-5}. But this 
determinant is also equal to (det A)’, which is a perfect square. We deduce 
that 5 is a perfect. square, which is a bit difficult to make happen... q 


We end this section with another rather challenging problem on permu- 
tations. 


4, A permutation o of {1,2,...,} is called k-limited if |o(Z) — i] < k for 
all 1 <%i< n. Prove that the number of k-limited permutations of 
{1,2,...,n} is odd if and only if n = 0,1 (mod 2k + 1). 


Putnam 2008 


Proof. Let M be the n x n matrix defined by Mij = 1y.j\<e. Tt is clear that 
the number of k-limited permutations has the same parity as det M. So the 
question is when M is invertible in M,(F2). From now on, we always work in 
F,. Let us prove that if n = 2,...,2k (mod 2k +1), then M is not invertible. 
Pick integers 0 < a@ < b < k such that n+a+ 6 = 0 (mod 2k +1) (it is 
clear that they exist under the assumption that n = 2,...,2k (mod 2k + 1)) 
and set j = (n +a + b)/(2k +1). If r; is the /th row of M, then the vector 
all of whose components are 1 can be written as pay Thpt—at(2k+1)i aud as 


Srat Tr1—b+(2k+1)i- Hence there is a nontrivial combination of the rows of M 
which yields the zero vector. Thus M is not invertible in this case. 
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Assume now that n = 0,1 (mod 2k+1) and that a1, 42,...,@, are scalars 
such that airj+agr2+++++anrn is the zero vector. Put a; = Oifi ¢ {1,...,n}. 
Then @mk + m—k+1 + +++ + Gm4e = 0 for all m. By comparing the relations 
for m and m+ 1, we obtain @m_~ = @m4iqi for 1 < m <n, so a; repeat 
with period 2k + 1. Taking m= 1,...,% further yields the equalities a,.2 = 
= Gong, = 0. Taking m = n~k,...,2—1 gives another chain of equalities 
Gnu 2k 0+ = Oni = 0. If n = 0 (mod 2k + 1), this can be rewritten as 


a, = +++ = a; = 0, whereas for n = 1 (mod 2k +1), it can be rewritten as 
aq = +++ = Gx41 = 0. In either case, since we also have ay + +--+ + agp41 = 0, 
we deduce that, all of the a; must be zero. The conclusion follows. Ql 


12.2 Matrices over Fy 


The reduction map Z — Z/2Z induces a natural map M,(Z) -> 
M,, (Z/2Z), which is easily seen to be a ring homomorphism. This map is 
very useful when dealing with applications of linear algebra in combinatorics, 
since for a lot of problems the parity gives already enough information. Also, 
the incidence matrices of families of sets are binary matrices and so are the 
adjacency matrices of graphs. To simplify notation, we let Fy = Z/2Z. 


p 5. 2n+ 1 real numbers have the property that no matter how we eliminate 

one of them, the rest can be divided into two groups of n numbers, the 

sum of the numbers in the two groups being the same. Prove that the 
numbers are equal, 


Proof. Wet x1, £2,...,2n41 be real numbers as in the problem and let X be 
the column vector whose coordinates are the x;’s. We can write the hypothesis 
in the form AX = 0 for some matrix A = (a,j) with aj; € {-1,0,1}, ai = 0 
and such that the sum of the numbers in each row of the matrix (aij) is zero. 
Of course, this linear system of equations has the trivial solution 2} = #2 = 

+ = Zen41 and the problem asks to prove that this is the only solution. 
Now, the dimension of the vector space of solutions is dimKerA, which is 
also 2n + 1~rkA. Thus, it is enough to prove that A has rank 2n. Of 
course, the rank is at most 2n, since the sum of the elements in each line is 
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0. But if we see A as a matrix over F2, A becomes the matrix with 0 on 
the diagonal and 1 elsewhere. Since 2n + 1 is odd, it is easy to check that 
this matrix has rank 2n (the associated system of equations over F2 is simply 
Ta+ +++ +241 = O,...,01 +++ +29, = 0, which clearly has the only solution 
y= €2 = -++ = Lan4i). But then A must have rank at least 2n, since the rank 
can only decrease after reduction mod 2 (this is simply saying that a nonzero 
element of Fp lifts to an odd, thus nonzero, integer). The result follows. 


Proof. The classical proof is done in three steps. Let x1,...,¢2n41 be the 
numbers in the problem. In the first step we assume that all 2; are integers. 
The hypothesis clearly implies that all x; have the same parity as 7; + #2 + 
++++ on21. Thus all 2; have the same parity. Writing them either as 2y; or 
as 2y; + 1, it is clear that y;,.-.,yon41 satisfy the same property as the z;’s. 
Since the sum of the absolute values of the y;’s is smaller than that of the «,’s, 
after finitely many steps deduce that all 2; are equal (because if the process 
continues forever then all the x; must have been Q). 

The second step treats the case when the «;'s are rational numbers. Multi- 
plying them by a suitable positive integer N to make them integers, we reduce 
trivially this case to the one considered above. 

Finally, we consider the general case when the 2,’s are real numbers. Let 
€1,€2,..-,€% be a basis of the Q-vector space generated by the z;’s. Write 
each 2; = @i1€; + aigeg + +--+ @ipex. Since the e;’s are linearly independent, 
it follows that for each fixed j, the numbers a1;,a2;,...,@2n41,7 satisfy the 
same properties as the x;’s. Since these numbers are rational, it follows that 
they are equal by the second step. But since this holds for every j, we have 
By 90 = Lo_21- ‘| 


Binary matrices are very useful when dealing with iterations of some pro- 
cess on a combinatorial structure. The fact that 2 kills M,(IF2) implies that 
for any commuting matrices A,B € M,(F2) and for any k > 1 we have 
(A+ B)™ = A + B®. This is very useful in computations. 


v 6. The edges of a regular 2”-gon are colored red and blue. A step consists 
in recoloring each edge whose neighbors have the same color in red and 
recoloring each edge whose neighbors have different colors in blue. Prove 


| 
\ 
| 
! 
} 
| 
1 
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that after 2*~! steps all of the edges will be red and that this need not 
hold after fewer steps. 


Tranian Olympiad 1998 


Proof. The easy part is the second question: if there is exactly one blue edge, 
then clearly after k < 2"! steps the &-th edge after this blue edge is blue, so 
it is impossible to make all edges red after less than 2”~! steps. 

Let us now prove that after 2"~! steps all edges are red. Let X; be the 
vector giving the state of the edges after the j-th step, so X; is the column 
vector with 2” coordinates, the k-th coordinate being 1 if the k-th edge is 
blue after 7 steps and 0 otherwise. By definition we have Xj; = AXj, where 
everything is taken modulo 2, A= 8+B7! and By = 1ifj +1 (ined 2") 
and 0 otherwise. Since B and B~' commute, the binomial formula and the fact 
that Pe) is even for all 1 < k < 2"7!—1 show that A2"™ = B?° 4 Br?"7 
(mod 2). However, it is very easy to compute the successive powers of B: 
a trivial induction shows that BE is simply the matrix whose entry is 1 if 
k (mod 2”) and © otherwise. In particular, B and so by the 
previous formula 4?""' = 0 (mod 2). But since X; j= AP Xo, it is clear that 
after 2°) steps all edges are red. 


The following problem is a classical old result, which has a very beautifu 
linear-algebraic proof. 


¥7. The map s: R" > R° is defined by 
8(@1,a9,...,@r) = (lai — al, lag ~ agl,..-s {ar — a1))- 
Prove the equivalence of the following statements: 


i) for all nonnegative integers aj, a2,-..,@,, there exists n such that 
the n-th iterate of s evaluated at (a1, a2,...,@,) is (0,0,...,0); 


ii) r is a power of 2. 


Ducci’s problem 
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Proof. Suppose first that r is a power of 2. Observe that the maximal coordi- 
nate of $(@1,@2,...,@,) is clearly smaller than or equal to the maximal coor- 
dinate of (a1,a2,...,a,). Start with some nonnegative integers (a1, a2,..-, Gy) 
and define Xy = (ay,42,...,@-) and Xni, = 8(X,). By the previous re- 
mark, all X,’s live in the hypercube [0,maxa,|". We will prove that the 
X,’s become arbitrarily divisible by 2, thus one of them will have to be equal 
to 6. To prove this, it is enough to prove that one of the X,’s has all co- 
ordinates even numbers, since then we can repeat the procedure to make 
the coordinates of another X,, multiples of 4 and so on. Now, note that 
if X_, = ie) ag(n),-..,4p(n)) then Xpai = (1 + T)(X,) (mod 2), where 
T (21, 02,.--,27} = (a. 3,-..,%1). Thus X, = (1+7)"Xo (med 2). Now, 
since () is even for all 1 << k <r — 1 (because r is a power of 2), we have 


(1+T)'Xo= S (,) PX = Xo+T"Xo (mod 2). 
kath 
But trivially T” = 1, the identity map, so Xp + 77 Xo = 2X9 = 0 Gnod 2). 
Thus all coordinates of X, are even and we are done. 

Assume now that i) holds. We deduce that for any vector # € F there 
exists 2 such that (1+ T)"« = 0. Indeed, any such x is the reduction mod 2 
of an r-tuple of nonnegative integers (@;,@2,-..,@,) and we know that we can 
find n such that s"((a1,09,...,a,)) = (0,...,0). But we also saw that 


8” ((a4,a9,...,4r)) = (1+ T)"(r1,¢2,...,2,) (mod 2). 


Now, since there are finitely many x € Ff, it follows that there exists n > 1 
such that (1+ 7')"x = 0 for all x € F5 (simply choose nz for each x and put 
n= maxn,). We claim that this forces r to be a power of 2. Notice that the 
minimal polynomial of T° as an endomorphism of F) is X’ +1 (T is simply the 
shift, so we can easily compute P(T) for any polynomial P). Thus, we must 
have X* +1|(X +1)” in F)[X]. In particular, we must have X? +1 = (X +1)? 
for some j <n and so (7) is even for all 1 < u <7 ~1, forcing j to be a power 
of 2. But then (1 +X)? = 14 X? and so r = j is also a power of 2. Oo 


Before discussing the next problems, we need some preliminaries. First, 
an easy but fundamental remark: suppose that f € Z[X1, X9,..., Xp] satisfies 


| 
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f(an,x2,...,2,) = 0 for all integers z;. Then an easy induction on n shows 
that all coefficients of f are zero. Thus, for all fields K and all x1, BQ,...,0n, € 
K we have f(a ;,22,...,t)) = 0 € K. Let us apply this observation to 
establish the following 


Theorem 12.2. Let K be a field and let A € M,(K) be such that ai +a; = 0 
for alli,j. Ifn is odd and if ay = 0 for alli, then det(A) = 0. 


Proof. Consider the polynomial 
| 0 az aig +++ Gin 


i-@iz 0 agg ++ Gan 
£(@12,013,---,dmn—1)=|7U3 423 0+ Asn} Zlarg,a13,..., dnn—tl- 


—@in Gan ~dan +> 0] 


By the previous discussion, it suffices to prove that f vanishes when eval- 
uated at any a;; € Z. But if X € M,,(Z) is antisymmetric, then 


det(X) = det(X‘) = det(—X) = (—1)” det(X) = ~ det(X), 


the first equality being standard, the second one the hypothesis and the last 
one a consequence of the fact that n is odd. Thus det(X) = 0 for all an- 
tisymmetric matrices X € M,(Z). But this is precisely what we wanted to 
prove. 


We consider next two applications of the previous theorem. 


W8. Suppose that n teams compete in a tournament wherein each team plays 
against any other team exactly once. In each game, 2 points are given 
to the winner, 1 point for a draw, and 0 points for the loser. It is known 
that for any subset. $ of teams, one can find a team (possibly in $ ) whose 
total score in the games with teams in S is odd. Prove that n is even. 


D, Karpov, Russian Olympiad 1972 
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Proof. Define the matrix A = (aij) by aj = 1 if there is a draw between #, 7 
and 0 otherwise. Clearly, ¢4 == 0 and aj; ji for all i,j. See A as a matrix 
with coefficients in Fz. We claim that A is invertible. Indeed, otherwise we 
can find a € F} nonzero such that Ar = 0. If J = {1 <i < nla; = 1}, then 
the condition Ar = 0 can be written }),<;aij = 0 for all 1 < i <n. Since 
each match which is not a draw yields an even number of points, the above 
condition implies that for all 7, the total score in the games between 7 and 
the teams in J is even, contradicting the hypothesis. Now, it is enough to use 
theorem 12.2 to conclude that 7 is even. O 


The following problem is essentially the same as the previous one, but 
written in different language. 


q 9. A simple graph has the property: given any nonempty set H of its 
vertices, there is a vertex z of the graph such that the number of edges 
connecting z with the points in H is odd. Prove that the graph has an 
even number of vertices. 


Proof. If A is the adjacency matrix of the graph, scen as matrix over Fy, 
then the hypothesis of the problem translates into: for all nonzero « € F} we 
have Az # 0. Thus A is invertible over Fg. Since A is clearly antisymmetric 
(because it is symmetric and we are in characteristic two) with zero diagonal, 
the result follows from theorem 12.2. Ci 


The following problem is rather easy. 


10. Let Ay, Ag,...,An be subsets of A = {1,2,...,n} such that for any 
nonempty subset T of A, there is i € A such that |A; T| is odd. 
Suppose that B,, By are subsets of A such that |A;9.By| = |A;M Bol = 
for all ¢. Prove that By = Bo. 


Gabriel Dospinescu, Mathematical Reflections 
Proof. Let’s consider the subsets of A as vectors in F}, where the entry in the 
ith position is 1 if the subset contains the element, 0 otherwise. Leti the vectors 
corresponding to A;’s form the rows of the matrix M. The first condition can 
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be expressed as Mu #4 0 for all v # 0. Let v be the vector all of whose 
components are 1. The second condition says that Mb; = v for all j, where 
b; are the vec ‘epresenting the B;’s. So we have M(b, — b2) = 0, whence 
by = be and By = Bo. Oo 


12.3 Applications of bilinear algebra 


In problems concerning incidences, it is sometimes useful to consider 
properties of inner products and bilinear algebra. If K is any field, we 
oe the inner product of two vectors x = (a1,79,-...dn) € K” and 

= (Y1.Y2,-.-,Un) € A” as 


(t,y) = xiyi + coy to + Pntn- 


If K =R, we let |[z|| = \/%z, 2) and call it the norm of x. The Cauchy-Schwarz 
inequality is equivalent to the triangle inequality ||x + y|] < {{al] + [lyjl. 

Let us start with a rather standard problem, which is nearly optimal. As 
the remark following the proof of it shows, this problem is actually rather 
subtle. We will see some variants of it later in the chapter. 


11. A*handbook classifies plants by 100 attributes (each plant either has a 
given attribute or does not have it). Two plants are dissimilar if they 
differ in at least 51 attributes. Show that the handbook cannot give 51 
plants all dissimilar from each other. 


Tournament of the Towns 1993 


Proof. More generally, assume that the handbook considers 2n attributes, that 
two plants are dissimilar if they differ in at least n + 1 of these, and that we 
have n+1 plants all dissimilar from each other. For each 7, let v; be the vector 
whose j-th coordinate is 1 if plant i has attribute 7 and —1 otherwise. Let (,) 
be the standard inner product and observe that the hypothesis implies that 
(uz, tj) < 0 for all i # 7. It is however clear that (v;,0;) is an even integer, 
thus we actually have (v%;,0;) < ~2 for all i 4 7. But then 


nti oe nth n+l 
oa = = Daten +a fe vj) S In(n +1) — 4( ) =o. 


wy 
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This implies that vj + vg+:+-+0n41 = 0. If n is odd, then we are quite stuc 
but if n is even, then last equality certainly cannot hold, as each component 
of vy + U2 4+°++>+ 41 is an odd number. Fortunately, n = 50 is even in our 
problem and so we are done. 


Proof. Let us assume that the book has 51 pairwise dissimilar plants, call them 
@1,42,.-.,4@5; and call the attributes P,, P2,...,Pioo. We will count triples 
(a:,4;,P,) with i < j so that @; and a; differ on attribute P;,. On the one 
hand, for each (a;,a;) there are at least 51 triples containing this pair, so the 
total number of triples is at least 51 Gh: On the other hand, if A, is the set 
of plants having attribute P;,, then there are |A,|(51 — [Ag|) < 25 - 26 pairs 
(a;,a;) living in triples with attribute P,. We deduce that 


51 


100 - 25 - 26 > S1- ea <=> 2600 > 2601, 


a contradiction. Oo 


Remark 12.3. A Hadamard matrix is a square matrix with entries +1 such 
that the rows are pairwise orthogonal vectors. It is a famous conjecture that 
for all n € 4N there exists an n x n Hadamard matrix. Assume that n+ 1 
is even and suppose that the previous conjecture holds. Thus we can fin 
a Hadamard matrix of order 2n +2. By multiplying some rows by ~1, we 
may assume that all entries of the first column are 1. Since the cohimns are 
orthogonal, each column except the first now has n + 1 positive and m +1 
negative terms. Now consider all the rows in which the first two terms are 
+1. There are 7+ 1 such rows. Take these rows and chop away the first two 
columns. The result is m+ 1 vectors of length 2n such that the inner product 
of every pair of vectors is —2. Interpreting the rows as plants and the columns 
as attributes, we get the required set of n+ 1 plants which pairwise differ 
in strictly more than half of their 2n attributes (in fact, every pair differs in 
exactly n + 1 attributes). One can actually exhibit Hadamard matrices of 
order 104, yielding 52 plants which pairwise differ in 52 out of 102 attributes. 


The next two problems are variations on the following nice and rather 
classical result. We will also present purely combinatorial proofs and we will 
leave the reader to decide which method is more efficient. 
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X] Theorem 12.4. Let v1, v2,...,ug be nonzero vectors in R® such that 
(vi, 03) <0 


for alli # j. Then k < 2n. Yr Ga de But Nat 


Proof. We will induct on n. For n = 1, it is an obvious application of the pi- 
geonhole principle, Now, suppose that the result holds for n -- 1 and consider 
1,02)... Up # O in R” such that (vj,v;) < 0 for all i 4 j. Dividing each v; by 
its norm and working with the new set of vectors, we may assume that 2, have 
norm 1. Choose an isometry A sending v; to the vector e,, whose first coordi- 
nate is 1 and all the others are 0. Working with the vectors Avy, Av2,..., Avg 
instead of 11, v2,..., vm (they satisfy the same hypothesis, as an isometry is in- 
vertible and preserves the inner product), we may assume that A is the identity 
and so v; = €;. Then the first coordinate of each of the vectors U2, U3, .665Uk 
is non-positive by hypothesis. Consider the vectors w2,w3,...,wy € R®-} 
obtained by deleting the first coordinate of v2, vs,..., vg. They are the pro- 
jections of vg, v3,..., 0% on the orthogonal complement to e,. Since the first 
coordinate of each of v2, 03,..., v4 is non-positive and since (v;, 05) < 0 for 
25147 < k, we also have (w;,w;) < 0 for all i # j. If wz, wa,..., wy are 


that some w,’s are zero. We claim that at most one of them is zero, If we 
manage to prove this, it will follow that k — 2 < 2(n ~— 1) and the inductive 
step will be proved. So, assume that we = wz = 0, then v1, v2, v3 are multiples 
of e; and since they are norm 1, each of them is equal to e; or —e;. But then 
we can find 1 <i < 7 < 3 such that v; = vj. As (vj, 0;) <0, it follows that 
vu, = 0, a contradiction. The result follows. 


Y 12. An m x n matrix is filed with 0 and 1 such that any two rows differ in 
at least n/2 positions. Prove that m < 2n. 


Sle ttre Cw? * Iranian Olympiad 
Proof. It is more convenient to replace all zeros in the table by ~1. Of course, 
this does not change anything to the hypothesis or conclusion of the problem. 
See each row as a vector in R” and let v1, 02,...,Um be these vectors, For any 
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i # j, the inner product (2, v;) is simply the difference between the number of 
positions where v; and v; coincide and the number of positions where the two 
vectors differ. Taking into account the hypothesis, we deduce that (v;,vj;) < 0 
for all i A j. Of course, v; 4 0 for all i. The result follows from theorem 
12.4. 


Proof. The crucial ingredient in the proof is the following 


Lemma 12.5. If in anm x n binary matrix every two rows differ in at least 


d positions and if 2d > n, then m < 534. 


Proof. Call a pair (x,y) of matrix entries good if z and y are in the same 
column and if (z,y) = (0,1). If d; is the number of ones in the ¢th column, 
then clearly there are S77_, di(m — dj) good pairs. Since d;(m — dj) < a 
there are at most nme good pairs. However, if we fix two rows, we find at 
least d good pairs, by hypothesis. Thus we find at least d(}) good pairs and 
sod) <n- me which immediately yields the lemma. 


Passing now to the proof, let A; be the number of rows whose first element 
is 1 and let Ag be the number of rows whose first element is 0. If we consider 
only the rows whose first element is 1 and then delete the first element in 
each such row, we obtain a set of rows such that any two differ in at least 
positions. By the previous lemma, we deduce that A, <m and Ag <n. Since 
m= A; + Ag, the result follows. oO 


y 13. Let 2 be a positive integer. If u = (u:)#", and v = (v;)2", are binary 
sequences of length 2”, let the distance between them be 


on 
d(u,v) = So hui — vj. 
i=1 


Find the greatest number of binary sequences of length 2” whose pairwise 
distances are at least 2"—!. 


China TST 2005 
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Proof. Replacing each coordinate 0 by —1 in each vector, we must find the 
maximal number of vectors of length 2” with coordinates +1 and whose pair- 
wise inner products are nonpositive. Theorem 12.4 shows that this maximal 
number is at most 2"*1. The problem is thus solved if we can exhibit 2°+! vec- 
tors with coordinates +1 in R?” and with nonpositive inner products. We will 
consider the vectors -kez, be2,...,-kegn, where ej, €2,-..,€9r is an orthogonal 
basis of R?" and all e; have coordinates +1. Such a basis can be constructed 
by induction: for n = 1 choose e, = (1,1) and e; = (1,--1). Assuming that 
we have such a basis for m, consider the vectors fj = (e;,¢;) for 1 < i < 2" 
(where (,y) is the vector obtained by considering the coordinates of x fol- 
lowed by the coordinates of y) and fj = (—ej—9n, e..9n) for 2741 <i < 2", 
We can immediately check that these vectors are linearly independent, and by 
construction they are orthogonal. The inductive step is proved. Ci 


Remark 12.6. A more general question is the following: given n and k, what is 
the maximal number of binary sequences of length n such that any two differ 
in at least & places? For some nontrivial cases of this problem, we refer the 
reader to chapter 10 of {7]. 


Here is another very classical and nontrivial result. There are purely 
combinatorial proofs, but they are really not very illuminating. On the other 
hand, the algebraic proof is very neat. 


14. Let Ay, Ag,..., Am be distinct subsets of a set A with n > 2 elements. 
Suppose that any two of these subsets have exactly one common element. 
Prove that m <n. 


Fisher's inequality 


Proof. We may assume that A= {1,2,...,n}. Following the usual technique 
for this kind of problem, consider the incidence matrix M of the sets, that is 
the m xn matrix defined by m3 = 1if 7 € Aj and 0 otherwise. By hypothesis, 
M- Mt‘ is the matrix with é-th row (1,1,...,|Ad,1,--.,1). 

For technical reasons, we have to di two cases. The first case is when 
|A;} = 1 for some i. But then by assumption all A; have to contain A; and so 
Aj \ Ay for j # 7 are pairwise disjoint nonempty subsets of a set with n~ 1 
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elements, thus certainly m ~ 1 < n- 1 and we are done. The hard case is 
when all |Aj] > 2 and the key point (to be proved below) is that in this case 
M-M‘ is invertible, thus of rank m. Since the rank of Af-M* is at most that 
of M, which is at. most », this will be enough to prove the result. 

In order to prove the key point, we will argue by contradiction: if 
M - M? ig not invertible, there exists a nonzero vector with real coordinates 
1, £2,...,2m such that M+. Mtx = 0. But then ‘rAf-Mta = 0 and so 


sF |Aj le? + 2 1 tay = 0. 


mh isi<jgm 


This can be also written as 


Sal — net +(x) 


Since |A;| > 2 for all 7, the last relation obviously implies that all 2; are zero, 
a contradiction. Thus, the claim is proved and the problem is solved. O 


Remark 12.7. One can also prove the invertibility of A -Af* in a different way, 
if [Aj] > 2 for all i. Namely, if Af. Méx = 0, then ({Aj| ~ la; + S = 0, where 
S= 414+ 42 +---+2,. But then cj = — as and so 


which obviously implies that S = 0 and so all x; are zero. 
Remark 12.8. We leave as an exercise to the reader to adapt the previous proof 
and deduce the following more general result: let A;, Ag,..., Am be different 
subsets of a set with n elements X. If there is @ € {1,2,...,m — 1} such that 
|A, 9 Ajj = a for all i # j, then m <n. 

It is also very hard to solve the following problem without linear algebra. 
On the other hand, even with the help of (bi)linear algebra, this problem is 
quite tricky. 
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15. Let Aj, A2,...,Am and By, By,..., By be families of distinct subsets of 


Benny Sudakov 


Proof. Let vj be the incidence vector of A; and let w; be the incidence vector 
of B,, seen as vectors in F§. The hypothesis becomes: (u;, w;) = 1 for all i,j, 
where (,) is the standard inner product on Fy. Let V be the vector space 
spanned by the vectors w;. Since the vectors v; + v, are m distinct vectors 
each orthogonal to every w;, thus to V, we have 


m< gdim Span(v;+03) < gn—dim Span(w;) 
Moreover, since (v;,wj) = 1 and (v1, wz + wy) = 0 for all j,k, the sets 
{w;]l <i <p} and {w+ wifl <i <p} 


are disjoint subsets of Span(w;). Thus 2p < 2%Span(wi)_ Multiplying these 
two inequalities yields the desired result. a) 


The next three problems also require a preliminary discussion. Namely, 
we will recall the proof of the following classical, but nontrivial result: 


Theorem 12.9, Let A € M,(F2) be a symmetric matrix and let d be the 
column vector whose coordinates are the entries on the main diagonal of A. 
Then there exists « € F} such that Ax = d. 


The easiest proof uses bilinear algebra: consider the standard inner prod- 
uct on F} and observe that it suffices to prove that any vector which is or- 
thogonal to Im(4A) is also orthogonal to d, But if v is such a vector, we have 


n 


n 
Ss SS aijw; v= 0 
jah 


1 


for all w; ¢ Fy and by exchanging the two sums we deduce that 


n 
) avi = 0 
i=] 


{ 
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for all 7. But then we also have 


n n 
Soy ( osm) =0, 


j=l i=l 


which can be written (taking into account that aj; + aj; = 0) 


ave = 0, 


i 
As u? = v;, we are done. 


16. Light bulbs £1, £2,.-..,£, are controlled by switches S1,S9,....S,. 
Switch S; changes the on/off status of light L, and possibly the sta- 
tus of some other lights. Suppose that if 5; changes the status of L; 
then S$; changes the status of L;. Initially all lights are off. Is it possible 
to operate the switches in such a way that all the lights are on? 


Uri Peled, AMM 10197 


Proof. This is a very easy application of the previous result: define aj; = 1 if 
S; changes the status of L, and 0 otherwise. By hypothesis, we have aij = a4 
and ai = 1, Thus the matrix A = (a;;), seen as matrix over Fp, is symmetric 
with diagonal elements 1. By the previous theorem, we can find a nonzero 
vector x € F} such that Ax is the diagonal vector of A. Thus, 377, aijaj = 1 
for all 7. If J is the set of those j with a; = 1, the previous equality tells us 
that if we operate the switches S; with j ¢ J, then the state of each L; will 
change an odd number of times and so all lights £; will be on. a] 


The following problem is a bit trickier. We also give a purely combinatorial 
proof, to see the difference... 


17. Let G be a graph. Prove that the set of its vertices can be partitioned in 
two groups (possibly empty) such that each group induces a subgraph 
in which all vertices have even degree. 


Gallai’s Cycle-Cocycle partition theorem 
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Proof. Consider the adjacency matrix A of the graph and perform the follow- 
ing operations on it: for each vertex é of odd degree, add a 1 in position (é, 7) 
of A, For all the other vertices i, leave position (i,7) as in the original matrix 
(ie. equal to 0). We thus get a new symmetric binary matrix B = (bj). By 
the previous theorem, there exists « € Fj such that 


for all i, Let Vj = {1 <i < nla; = 1} and let VQ be its complement. We claim 
that this partition of the vertices satisfies the desired conditions. Indeed, for 
i € Vy we have Lijevi-{a bi; = 0, so ¢ has an even degree in the subgraph 
induced by Vj. For i € V2, we have 


7EVe—-{i} 3 


n n 


big — S by = So bis 


1 jeViv{i} j=l 


and the last quantity is 0 by construction. Thus i has even degree in V2 and 
we are done. a 


We continue with a nice variation on the previous result. 


18, At a certain mathematical conference, every pair of mathematicians are 
either friends or strangers. At mealtime, every participant eats in one 
of two large dining rooms. Each mathematician insists upon eating in a 
room which contains an even number of his or her friends. Prove that 
the number of ways that the mathematicians may be split between the 
two rooms is a power of two . 


USAMO 2008 


Proof. Note that this problem implies the existence of at least one admissible 
partition (i.e. one that satisfies the conditions of the problem), which is pre- 
cisely the content of the previous problem. Fortunately, the hard work was 
actually done. Let G be the graph whose vertices are the mathematicians, 
two vertices being connected if the mathematicians are friends. Suppose that 
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G has n vertices 1,2,...,n and consider the matrix B as in the solution of 
the previous problem. We claim that there is a natural bijection between ad- 
missible partitions and solutions of the system Bx = d, where d is the main 
diagonal of B. Indeed, let us identify admissible partitions Co,C, with vec- 
tors 2 € F§, where x; = 0 if i € Co and x; = Lif i € Cy. The admissibility 
condition can be written: for all 7 we have 


1=0 (mod 2), 
by = lL jFijEC (i) 


where C(%) is the class of the partition containing 7. Just as in the proof of 
the previous problem, this condition can also be written 


n 
ys byxy = bi. 
j=l 


Thus, the number of admissible partitions is the number of vectors x € F} 
such that Ba = d. We know that there is at least one such vector ¢q (by the 
discussion preceding problem 10 or by the previous problem). But then 


{2|Ax = d} = ap + {z|Axr = 0} 


and {x|Ax = 0} = Ker(A) is a linear subspace of F}, so it has cardinality 
gdim Ker(A)_ Since this is a power of 2, the conclusion follows. 


12.4 Matrix equations 


To fully appreciate the power of linear algebra in combinatorics, the reader 
should try to find a purely combinatorial solution te the following problem, 


f 19. Let Gi,G2,...,G_ be complete bipartite subgraphs of the complete 


graph Kg, with 2n vertices. Assume that every edge of Ken is contained 
in an odd number of subgraphs G1, G2,...,Gz. Prove that k > n. 


Proof. Let 1,2,...,2n be the vertices of Koy, and let L;, R; be disjoint subsets 
of {1,2,...,2n} such that the set of vertices of G; is L; UR; and two vertices 
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are connected if and only if one is in L; and the other is in Rj. If l; € F3" is the 
vector whose nonzero coordinates are precisely those on positions belonging to 
L, and if r; is similarly defined, then the entry in position (u,v) of the matrix 
1, rf is precisely uct, ver, We deduce that the entry in position (wu, v) of 


k 
X= SG ‘ rf ari tt) € M2, (IF2) 
i=l] 


is 0 if u = v and it is the number of subgraphs containing the edge uv, 
taken mod 2, otherwise, Therefore, the hypothesis can be reformulated as an 
equality of matrices X = J — Jon, where J is the matrix all of whose entries 
are 1. Now, matrices of the form 1 - v5 are precisely the matrices of rank at 
most 1. So, we expressed X as a sum of 2k matrices of rank at most 1. Since 
the rank function is sub-additive (thought of in terms of endomorphisms, this 
is obvious: the image of u + v is contained in the sum of the images of u and 
v for any endomorphisms w, v of a vector space), in order to solve the problem 
it is enough to prove that J — fy, has rank 2n, i.e. that it is invertible. This is 
however trivial: if Jr = x for some vector z, then 2; = 21 +22+++++ Zon, for 
all 2, so first all a;’s are equal and then 2; = 2nz; = 0, since we are working 
in chatacteristic 2. The result follows. 


We also discussed the following beautiful problem in example 7, chapter 
23 of [3]. We give here a very natural solution using the following classical 
result: 


Theorem 12.10, Let X € M,(R) and Y € M,,(R). There exists a nonzero 
matrizt A € Maxm(R) such that XA = AY if and only if X,Y have a common 
eigenvalue. 


Proof. Assume first that there exists a nonzero matrix A € Myxm(R) such 
that XA = AY, but X,Y have no common eigenvalue. Thus the characteristic 
polynomials P.Q of X,Y are relatively prime and so there exist R, 9 € C{T] 
such that PR + QS = 1. Now, since XA = AY, we have by the Cayley- 
Hamilton theorem 0 = P(X)A = AP(Y). But we also have AQ(Y) = 0, as 
Q(Y) = 0 (again by Cayley-Hamilton). Thus 


A= A(PR)Y) + A(QS)Y) = AP(Y)R(Y) + AQ(Y)S(Y) = 0, 
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a contradiction. This settles the first part. Next, if X,Y have a common 
eigenvalue A, then X and Y* also have the common eigenvalue \ and so one 
can find nonzero vectors v, w such that Xv = dv, ¥'w = Aw. It is then easy to 
check that A = vw! is a nonzero complex matrix such that XA = AY. Then 
one of Re(A) and Im(A) is a nonzero real matrix satisfying again XA = AY 
and we are done. 


20. A grid divides an n xm sheet of paper into unit squares. The two sides of 
length nm are taped together to form a cylinder. Prove that it is possible 
to write a real number in each square, not all zero, so that each number 
is the sum of the numbers in the neighboring squares, if and only if there 
exist integers &,/ such that n+ 1 does not divide k and 


kr 1 


n+l 2 


2a 
COs ——~ + COS 


Ciprian Manolescu, Romanian TST 1998 


Proof. See the grid as an n x m matrix A and define ao; = an41,4 = 0 for all 
i. The problem becomes: there exists a nonzero matrix A such that 


Gig FF Ani HF Gipayy + Gij-1 + Gigi 


(for all 1 <i <n, 7 being taken mod m) if and only if there exist integers 
k,l as in the statement. The previous equality can be written very simply 
by introducing the matrices B € M,(Z), C € Mm{Z) defined by 8; = 1 if 


Indeed, the previous relation is equivalent to AC+BA = A, or BA = A([—C). 


By theorem 12.10, the existence of the matrix A is equivalent to the existence 


of a common eigenvalue of B and J ~ C. This is equivalent to the existence of 
eigenvalues Aj, Az of B, respectively C such that Ay-+A2 = 1, Let us compute 
the eigenvalues of C. If Cz = Aa for a nonzero vector x, we can write 


fg¢ 2m =A, Utarg=ATq, «2.5 Bt om-1 = AITm 
and so if we extend 2; by m-periodicity, we can simply write this as 


Tipe $ Li = ATi. 
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If r1,r2 are the roots of the equation ¢? ~ At +1 = 0, then a; = Cyri + Cori, 
(Cyrj + Cir] if ry = ra). Imposing the condition of m-periodicity, we see that 
we should have rj" = rf’ = 1 and (because r; and rg are reciprocal roots of 
unity) 
a Ql 
A= ry tre =7r1+7) = 2Re(ry) = 2.008 
for some 0 <1 < m. This suggests the form of the eigenvalues of C. To make 
this argument completely rigorous, it suffices to reverse the computations: 
simply choose any 0 <1 < m, set 4 = 2cos #2, ry = e* and choose 2; = ri. 
Then 2 is an eigenvector with eigenvalue A. Since we have found the correct 
number of eigenvectors, we are done: the eigenvalues of C are 2cos on (the 
argument was a bit long, but we wanted to present it in this form as it is 
rather useful). The same kind of reasoning shows that the eigenvalues of B 
are 2.cos peat and the result follows from these computations and the previous 
discussion. 


We give two proofs for the following beautiful and classical result. For 
the first proof we follow [29], while the second proof appears in [42}. 


¥ 21, In a society, acquaintance is mutual and any two persons have exactly 
one common friend. Then there is a person who knows all the others. 


The friendship theorem, Erdés, Rényi, Sdés 


Proof. First, we will prove that either there is a universal friend (i.e. a person 
knowing all the others) or any two persons have the same number of friends. 
if A and B are not friends, let @),a2,..., a be A’s friends and let bj, ba,...,b; 
be B’s friends. By assumption, a; and B have a unique common friend bs.) 
and b;, A have a unique common friend ag(;). Since f and g are clearly inverses 
to each other we have k = / and. we are done. Consider the obvious graph of 
enemy relationships in this society and suppose that it is disconnected. Then 
the society can be partitioned into two sets X and Y such that everyone in set 
X is friends with everyone in set Y. If both X and Y have two people or more, 
then two people in X have at least two common friends in Y, contradiction. 
On the other hand, if either set contains only one person then that person is 
a universal friend. Thus, if the enemy graph is disconnected, there must be 
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a universal friend, On the other hand, if the enemy graph is connected, then 
every person has the same number of friends, because we saw that two enemies 
have the same number of friends. This proves the claim and also shows that 
we may assume that the enemy graph is connected in what follows. 

Let d be the number of friends of each person and let n be the number 
of people. Then, the number of pairs of people with a specific person as their 
comumon friend is , so by counting the number of pairs of people in two 
ways we get (3) = n{§) and so n = d? ~d+1. Finally, let A be the adjacency 
matrix of the graph. By our previous work, we can write A = (d—-1)I + J, 
where J is the matrix having 1 at each entry. Trivially, (d - 1)J + J has 
eigenvalues d* and d -- 1 (with multiplicities 1 and n — 1 respectively). Thus 
the eigenvalues of A are d and +/d—1. Since the trace of A is zero, we 
deduce that d+ (a — b)/d—1 = 0, where a,b are the multiplicities of the 
eigenvalues +./d - 1. Squaring the last relation, we deduce that d-- 1 divides 
d’, so that d = 1 or d= 2. But both such cases trivially satisfy the theorem, 
which gives the contradiction we needed. 


Proof. As in the previous solution, we prove that either there is a universal 
friend or the associated graph G is d-regular and connected, for some d such 
that n = d* —-d+1. Now comes the beautiful idea: for a positive integer p, 
let c(p) be the number of cycles of people of length p such that each person in 
the cycle is friends with the next person (people can appear multiple times in 
such a cycle). Any such cycle can be constructed in the following way: pick a 
starting person, pick one of his friends, pick a friend of that friend, etc., until 
you are at the second to last person in the cycle. If the second to last person 
is different from the first person, then the last person must be their common 
friend, but if the second to last person is the same as the first then the last 
person can be any of the first person’s d friends. Thus 


e(p) = nd“? + (d — 1)e(p — 2). 


ifd #0 and d # 2, pick a prime divisor p of d—1. Then, since n = d?—d+1, 
we have n = 1 (mod p) and so c(p) = 1 (mod p). On the other hand, c(p) 
is a multiple of p: there are no cycles that are shifts of themselves (since no . 
person is a friend of himself or herself... ), so the number of cycles of length 
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p must be a multiple of p. The last two results are not possible at the same 
time, so that we must have d = 0 or d = 2. But in this case it is clear that 
there is a universal friend, which finishes the proof. oO 


Remark 12.11. For infinitely many persons, this result does not hold anymore. 


Remark 12.12. The theorem actually classifies all graphs as in the statement: 
all of them consist of a collection of triangles sharing a vertex. 


Remark 12.13. In {42} one can also find a combination of the first two solutions, 
which is very elegant. Namely, first we prove that the graph is d-regular for 
some d such that n = d? —d+1 and that the adjacency matrix A of the graph 
satisfies A? = (d -1)] + J, where J is the matrix all of whose entries are 1. 
Suppose that ¢ 2 3 and pick a prime p dividing d— 1. Working in M,(F,), 
we obtain A? = J and since AJ = d.J, we have AJ = J. Then AP = J and 
so tr(A?) = n (mod p). On the other hand, an extension of Fermat’s little 
‘theorem yields tr( A?) = (trA)? = trA (mod p) (working with the eigenvalues 
of A this reduces easily to theorem 9.15). But clearly trA = 0. We deduce 
that p divides n, which is clearly impossible. Thus d < 2. 

We end this section with another classical result in algebraic combina- 
torics. 


22. In a graph G with n? + 1 vertices suppose that every vertex has degree 
n and every cycle has length at least 5. Then n € {1,2,3,7,57}. 


Hoffman-Singleton’s theorem 


Proof. The combinatorial part of the theorem is contained in 


Lemma 12.14, If x,y are vertices of G, the number of common neighbors of 
zy is 0 if x,y are connected and 1 otherwise. 


Proof. We will count triples (x,(y,z)) in which z,y,z are vertices and x 
is connected to both y and z. Counting them according to 2, we obtain 
(n? +1)(§) triples (since there are n? +-1 vertices, each of degree n). On the 
other hand, two connected vertices cannot belong to a triple (otherwise we 
would obtain a 3-cycle in G) and two non-connected vertices y,z can belong 
to at most one triple (otherwise G would have a 4-cycle). So the number 
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of triples is at most (ne 1) minus the number of edges of the graph, so at 
most: ery - (Prue = (n® +1)(3). Since we have already established that 
there are precisely (n? + 1)(5) triples, it follows that every two non-connected 
vertices belong to exactly one triple and the lemma is proved. 0 


Let A be the incidence matrix of G and let J be the matrix all of whose 
entries are 1. The previous lemma yields 


A =nl¢J-I-A=(n-l-AtJ. 


Next, A is a symmetric matrix with real entries and zero diagonal, so it has 
zero trace and it is diagonalizable in an orthonormal basis (by the spectral 
theorem). In particular, all eigenvalues of A are real and the eigenvectors 
corresponding to different eigenvalues are orthogonal. Since the sum of entries 
in each row is n (because G is n-regular), we have Av; = nv1, where v; is 
the vector all of whose coordinates are 1. So, if Av = Av for some v # 0 and 
A #n, then (vj,v) = 0 and so Jv = 0. But this implies that \? = n—1—-.. 
So, if r1,r2 are the roots of the equation 2? + ¢ = n— 1, any eigenvalue of A 
different from n is ry or rz. 

Next, we claim that n has multiplicity 1 as an eigenvalue of A. It suffices 
to prove that if Av = rv, then v is collinear to v,. This is easy, since Av = nv 
implies n?v = A?v = (n ~ 1)u ~ Av + Ju and so Ju = (n? + 1)u, hence all 
coordinates of v are equal and v is collinear to 1. Let @,6 be the multiplicities 
of r1, rg as eigenvalues of A. So a+b = n? by the result we have just established 
and ar, + br2 +2 = 0, as A has trace 0. Since 


~1+/4n-3 i -1-—V4n—-3 
2 : 2 : 


N= 


the previous relation becomes 


atb an~b, 
— 4 An ~ 3 5 tn = 0. 


a=bandn =a. Sincea+b = n?, this gives n = 2 and we are done. If 
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2 
k = \/4n —3 is rational, write a 4-5 =n? = (£#) and replace this in the 
previous relation. We obtain 


16k(a — b) ~ (k® +3)? + 8(K? + 3) = 0. 


Reducing this mod k, we finally obtain k/15 and since n = Bas the result 
follows. Ct 


Remark 12.15. The proof of the lemma actually yields the following result: 
if G is a graph in which any cycle has length at least 5 and any vertex has 
degree at least n, then G has at least n? + 1 vertices. So the theorem proved 
above actually classifies the extremal case. At the time of this writing, it is 
unknown whether or not there exists a graph as in the theorem for n = 57 
(for all the other values of n, such graphs exist). 


12.5 The linear independence trick 


23. In an m x n table, real numbers are written such that for any two rows 
and any two columns, the sum of the numbers situated in opposite cor- 
ners of the rectangle formed by them is equal io the sum of the numbers 
situated in the other two opposite corners. Some of the numbers are 
erased, but the remaining ones allow us to find the erased numbers us- 
ing the above property. Prove that at least n +m —1 numbers remained 
on the table. 


Russian Olympiad 1971 


Proof. Consider the set X of all m xn matrices with real entries such that for 
any two rows and columns, the sum of the numbers situated in the opposite 
corners of the rectangle formed by them is equal to the sum of the numbers 
situated in the other two opposite corners. This set is obviously a linear 
subspace of Mrxn(R). If $ is the set of pairs (i,7) such that the number 
situated at the entry (i,j) has not been erased, then the hypothesis can be 
translated into: the map f : X + RII obtained by sending a matrix A € X 
to the collection (a;,;)(; eg is injective. But then the dimension of X has to 
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be at most the dimension of R/S!. Thus, |S is at least equal to the dimension 
of X as R-vector space. Therefore it is enough to exhibit m +n — 1 linearly 
independent matrices in X. This is easy: simply consider the matrices A; 
having the j-th row equal to the vector (1,1,...,1) and the other rows zero 
(and this for all 1 < 7 < m), then the matrices B; having the i-th column 
consisting of ones and the remaining entries 0 (and this for all 1 <i <n~ 1) 


It is immediate to check that they are in X and linearly independent. Oo 


Once the necessary mathematical translations are done, the proof of the 
following problem is shorter than its statement. .. 


24. In a contest consisting of n problems, the jury defines the difficulty of 
each problem by assigning it a positive integral number of points.’ Any 
participant who answers the problem correctly receives that number of 
points for the problem; any other participant receives 0 points. After the 
participants submitted their answers in such a contest, the jury realized 
that given any ordering of the participants, it could have defined the 
problems’ difficulty levels to make that ordering coincide with the par- 
ticipants’ ranking according to their total scores.? Determine, in terms 
of n, the maximum number of participants for which such a scenario 
could occur. 


Russian Olympiad 2001 


Proof. lt is clear that we may have 7 participants: impose that participant j 
solves problem 7 and nothing else. We will prove that we cannot have more 
than n participants. By associating to participant j the vector xj; consisting 
of the number of points he received for each problem, the hypothesis implies 
that xj € RY and for every ordering of the a;’s, there is a vector y € RY. which 
gives that ordering of (a;, y). On the other hand, if the number of participants 
m. is greater than n, there is a nontrivial linear combination of the x;’s that 
vanishes. By separating the coefficients of such a combination according to 
their sign, we find an equality >, a;a; = }), 42; for some disjoint subsets 


'The same number of points may be assigned to different problems. 
2 Ties are not permitted. 
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I, 7 of indices and positive numbers aj, 8; such that So, 04 > yy 8. But it is 
clearly impossible to rank all x; above all x;. 


There is no known combinatorial proof of the following result. On the 
other hand, the algebraic proof is rather simple. 


25. Let m > n+ 1 and let A), Ag,...,Am be subsets of {1,2,...,n}. Then 
there are disjoint sets I, J such that U Aj = U Aj and ‘a Ay = ial Aj. 

ai ded ied ged 
Lindstrom's theorem 


Proof. Associate to each set Aj a vector vj; € R?”, say 
U; = (21, 42,...,n,1— 24,1 ~2Q,...,1 2p), 


where 2; = 1 if 7 € A; and 0 otherwise. We obtain at least n -+- 2 vectors that 
lie in the (n + 1)-dimensional vector subspace of R?” defined by the equations 
T+ =t2+yo=-++=Ip+Yn. Thus, these vectors are linearly dependent 
and we can find @),42,...,@m € R not all zero and such that Sy av; = 0. 
Define now I = {ila; > 0} and J = {ila; < 0}. Clearly I,J are disjoint 
and we prove that UA = U Aj and (4 = ‘al Ay. This is very easy: if 


ier jes i€E jer 
gE U A; but 7 ¢ U Aj, then by definition the jth component of all v; with 
ier jes 


i € J is 0, whereas there exists an i € I with the jth component of v; nonzero. 
But then, looking at the jth component in the equality 7, ajv; = 0 yields 
a contradiction. By symmetry, we deduce that UA = U Aj. Similarly, 


tet jel 
looking at the n+1,...,2nth coordinate in the equality 7%, aiv; = 0 we can 
show that (\Ai = ‘a Aj, which finishes the proof of the theorem. aa 


qel G65 
In order to motivate the following result, we need some preliminaries. 
Suppose that F is a collection of subsets of some set X. We say that F 


shatters a subset S of X if {AN S|A € F} contains all subsets of S. The 
Vapnik-Chervonenkis dimension of F (also denoted ve(F)) is the maximal 
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the collection of all affine half-spaces. Then ve(F') = n +1, though this is 
not obvious. Note that F shatters the set. of vertices of an n + 1-simplex, so 
ve(F) >n+1. The converse inequality follows from the following classical 
theorem of Radon: if S CR” is a set with more than n+ 1 elements, then one 
can find disjoint subsets A, B of S such that the convex hulls of A and B have 
common points. The following beautiful result shows that large collections 
shatter big sets. 


26. Let F be a family of subsets of {1,2,...,”} such that 
nr nr nm 
ne (Je()e-e() 


Then ve(F) > k. 
Sauer-Shelah’s lemma 


Proof. Let us assume that ve(F) < k — 1. We will prove that 


iis ()+(3) ++ () 


Let P;,_1 be the family of all subsets A of {1,2,...,} such that |A] < &-1. To 
any X € F associate the map fx : Px_; 4 R defined by fx(A) = Lacx (this 
is equal to 1 if A C X and to 0 otherwise). We will prove that the maps fy are 
linearly independent when X runs over the elements of F. This will clearly 
imply the desired bound on the size of F. Assume that )°y<¢p@xfx = 0 for 
sorne real numbers @x, not all equal to zero. Evaluating at elements A € Py_1, 
we deduce that for any such A we have Soy. PACK &x = 0. The key step is to 
choose a minimal subset A such that }> Xeracx ex #0. We will prove that 
for any Bc A we can find X € F such that B= AMX. This will contradict 
the property of F and the result will follow. It is of course enough to prove 
that Syerpexna@x #0. But by the inclusion-exclusion principle we have 


SM a= SCF YS ax 


XEPXNA=B BCCCA XEP CCX 
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However, by minimality of A all sums > XeF.Ccx @X are zero, except for C = 
A. Thus the right-hand side is nothing more than (—1)!4~8! KER ACK OX: 
which is nonzero by the choice of A. This finishes the proof. C 


Proof. We will prove by induction on {F]| the following statement: for any n 
and any family F of subsets of {1,2,...,n}, F shatters at least |F} subsets of 
{1,2,...,%}. This trivially implies the desired result. Now, the result clearly 
holds for |¥| = 1, so assume that it holds for all |F] < N and take a family 
F with N elemenis, all of them subsets of {1,2,...,2}. Let Fo consist of the 
elements of F not containing 1 (we assume, without loss of generality, that 1 
occurs in some, but not all elements of F), By induction, Fo shatters at least 
|Fo| subsets of {2,3,...,}. Also by induction, Fy = {X\{1}: X € F,1€ X} 
shatters at least [71] subsets of {2,3,...,n}. But if A C {2,3,...,n} is 
shattered by F; then A is also shattered by {X € Fil € X}, and if A is 
shattered by both Fo and F; then both A and AU {1} are shattered by F. 
We conclude that at least |[Fo|+|Fi| = |F| subsets of {1,2,...,n} are shattered 
by F, and the result follows. 


Remark 12.16. Here is a beautiful geometrical interpretation of the previous 
result: let S$ be a subset of {~1,1}" with more than 374, (") elements. Then 
there exists a subset IC {1,2,...,2} such that 


{(xiierla € S} = {1,1}. 


That is, a large subset of the vertices of the unit hypercube has a coordinate 
projection covering a unit hypercube in a smaller dimension! 


We present two proofs of the following classical theorem of Bollobas. One 
of the proofs needs some preliminaries on resultants. Let K be a field and let 
f = ag +a1,X +++ +apX? and g = bo + 1X +--+ +b,X% be two polynomials 
with coefficients in K. Let K[X], be the n+ 1-dimensional K-vector space of 
polynomials of degree at most n in K[X]. It is easy to see that gcd(f,g) = 
if and only if the map y : K[X]g-1 x K[X]p-1 > KX ]ptqei, o(U,V) = 
Uf + Vq is injective. Since the source and the target are K-vector spaces 
of the same finite dimension, it follows that ged(f,g) = 1 if and only if ¢ is 
an invertible linear map, ie. if and only if the matrix of y in the natural 
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bases of K{X]q.1 x K[X]p-1 and K[X]p4q-1 is invertible. This matrix has the 
first ¢ columns the coordinates of f, X f,...,X¢-'f with respect to the basis 
(1,.X,...,X?+9-1) and the last p columns the coordinates of g, Xg,...,X?~1g 
with respect to the same basis. The determinant of this matrix is called the 
resultant of f and g and is denoted Res(f,g). The previous discussion shows 
that Res(f,g) = 0 if and only if f and g have a common nontrivial divisor, 
which is the same as saying that they have a common root in some algebraic 
closure of Kk. 


27, Let a,6 be positive integers and let A), A2,...,Am and Bi, Bo,...,Bm 
be sets such that 


a) |Ai| = [Ag] = -++ = [Am| = @ and [By| = [Bal = --- = [Bal = 6 
b) A; B; is nonempty if and only if ¢ 4 j. 


Then m < (27°). 


Bollobds’ theorem 


Proof. We will only assume that A; B; 4 @ for i < 7. We may assume that 
Aj, By are subsets of {1,2,...,n} for some n. Choose 24, ¥2,.--,Un € Rt? 
such that any a+ 1 vectors among them are linearly independent (this is easily 
achievable). Let uj € R** be a vector orthogonal to all v, with u © Ay. Such 
a vector exists and is unique up to a scalar multiple as by construction the 
vectors (vx)ueA, span a hyperplane of R°*. 

Consider the polynomials 


fi(X) = TJ @.), 


EB; 


where {,) is the standard inner product and X = (Xq, X2,...,Xaii). These 
are homogeneous polynomials of degree 6 in a+ 1 variables, liv ing thus in a 
space of dimension (*}"). We claim that f,(v) = 0 for i <j and fi(vi) 40, 
which implies that the f; are linearly independent and so m < CH: To prove 
the claim, let i < j. Since A;MB; # 0, there is x € BjNA; and so (ee. +) = 0. 
Thus f;(up) = 0 fori < A Also, since A; MB; = @, we have (vz, up) 7 0 for 
all « € Bj, thus fi(vt) 4 Qo 
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Proof. Of course, we may assume that all Aj, B; are subsets of R. Consider 
the polynomials 


f(X)= [[X-a), 9x) = T] x -9). 


aca; bEB; 


By assumption, f; and 9; have a common root if and only if i 4 7. Thus, if 
R(f,g) is the resultant of two polynomials fig, we have R(fi,9;) = 0 if and 
only if 2 A 7. 
Now, write 
WX) = Go + NX Ho + qx? 


and observe that the expression 


A;(90,---+ 9) = RCfisg) 


is a homogeneous polynomial of degree a in the variables 90;--+,96- Moreover, 
the previous results show that Ai(gjo,--+.996) = 0 if and only if 14 j. Thus, 
the polynomials A; are linearly independent and homogeneous of degree a in 
+1 variables. Since the vector space of such polynomials has dimension 
CP); the result follows. 


The following problem is very challenging. We present the author’s proof. 
y : 


28. Let £1, #2,...,2, be distinct real numbers and suppose that the vector 
space spanned by x; — x; over the rationals has dimension m, Then the 
vector space spanned only by those 2; — 2; for which x; -— a; 4 xy ~ 2 
whenever (i,j) # (k,J) also has dimension m. 


Straus’s theorem 


Proof. By working with the numbers 2; — 21, we may assume that 2, = 0. 
Let V be the span of all differences 2; — 23,80 dimV = m. Say a difference 
x; ~ x; is unique if x; ~ x; = x, ~ 2 implies that i = k and j=l. Let V’ be 
the span of the unique differences, let m’ be its dimension and suppose that 
m!’ <m. As V' is a subspace of V, there is a basis 1, v2,...,¥m of V such 
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that v1, V2,-..,Um is a basis of V’. As x1 = 0, we have 2; € V for all i, 80 we 
can write 
m 
(i 
a ya! Day 
=1 


with a € Q. Note that there exists r such that 2” # 0, as otherwise V 
would be included in the span of the vectors vj, v2,.. +1 Umm1, contradicting 
that dim V =m. . 

Let now 2;(t) = a; + tal”, The crucial fact is the following 


Lemma 12.17. There exists t € R such that 
1) If x;(t) = 2j(t), theni = j. 
2) max 2;(t) — mina;(t) > maxx; — ming. 


Proof. For all i # j, the equation x;(t) = 2;(t) has at most one solution 
(recall that x; 4 «,), so 1) fails for at most finitely many ¢. On the other 
hand, z(t) = 0, so if r is chosen such that a) # 0, then the left-hand side 
of 2) is at least 2, + to” and this becomes arbitrarily large for t >> 0 or 
i << 0. The conclusion follows. oO 


Coming back to the proof, fix such a ¢ and choose j,k such that 2;(¢) = 
max2j(t) and a(t) = mina;(t). We claim that the difference tj — Xp is 
unique and al) x 2), This will yield a contradiction, as ay— a, € V' in 
this case, so its coordinate with respect to um has to be zero. To prove the 
claim, assume that 2; — a2, = 2 — Zp. Looking at the mth coordinate, we 
obtain a) - a = xf”) — 2)” so that xj(t) ~ xy(t) = x(t) — x, (t). But 
then clearly 2;(t) = 2,(t) and z(t) = zy(t), so that by the choice of ¢ we 
must have j = u and k = v. Finally, if a) = a”) , then 


a(t) — ag(t) = 2; ~ zp < maxx; — minay, 


a contradiction. o 
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12.6 Applications to geometry 


In this section we discuss some rather hard problems with geometric flavor. 
Before passing to the first problem, we need to recall a few basic facts from 
convex geometry. Suppose that K is a closed convex subset of R®. The dual 
of K is 


K* = {x € R"|(z, v) > 0,Vu € K}, 


where (-) is the standard inner product on R”. It is easy to see that K* is 
again a closed convex set. Standard but nontrivial separation theorems in 
convex geometry imply the fundamental equality K** = K for any closed and 
convex subset K of R”. Classical examples of closed convex sets are the convex. 
hull of finitely many points and the cone generated by finitely many vectors.? 
Applying the equality K** = K to the cone generated by 0), v2,...,v%, we 
obtain the following beautiful result, which is one of the many versions of 
Farkas’ lemma: 


Theorem 12.18. (Farkas’ lemma) Let v,v1,v2,..., 0% € R” and suppose that 
(w,v) > O whenever min(w,v;) > 0. Then v is in the cone generated by 
UT, U2; .065 UR. 


The following hard problem is a very nice way to re-state Farkas’ lemma. 


29. A figure composed of 1 by 1 squares has the property that if the squares 
of a fixed m by x rectangle are filled with numbers the sum of all of 
which is positive, the figure can be placed on the rectangle so that the 
numbers it covers also have positive sum.® Prove that a number of such 
figures can be placed on the rectangle such that each square is covered 
by the same number of figures. 


Russia 1998 


vx € R", the cone generated by these vectors is the set of linear combinations 
10, + Gaby + +++ + axvK, Where a; are nonnegative real numbers. It is clear that this cone 
is a convex set, but it is not really obvious that it is a closed subset of R”. 

“Possibly after being rotated by a multiple of ad 

* However, the figure may not have any of its squares outside the rectangle. 
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Proof. The first step (and the trickiest...} is to restate the problem in a more 
algebraic way. Suppose that we have N ways to put the figure in the rectangle, 
possibly after being rotated and call these ways 1,2,...,N. To each such way 
7 associate its incidence vector v; € R”", where the jth coordinate of v; is 1 if 
the square j is covered and 0 otherwise (we number the squares of the table 
in some way from 1 to mn). Now, the hypothesis becomes: for any v € R™* 
such that (v,1} > 0, we also have max(v, v;) > 0. Here 1 € R” is the vector 
all of whose coordinates are 1. The result then follows from the following 


Lemma 12.19. Let u,v1,v2,...,un € Q” be vectors, with v nonzero. Suppose 
that for allw € R”, if (w,v) > 0, then max(w, vj) > 0. Then there are rational 
nonnegative nurnbers a1,42,..., an such that v = ayv, + agtg +--+ + anun. 


If we accept this for a moment, we deduce the existence of nonnegative 
integers @1,@2,...,@n and of a. positive integer a such that @-1 = av, +-aqve+ 
+--+ anun. So, if we put a; copies of the ith placement of the figure, each 
square is covered a times and the problem is solved. 

Now, let us turn to the proof of the lemma. By Farkas’ lemma, we can find 
nonnegative real numbers x1, x2,...,¢y such that v = 2yvy+a2202+->-+ayun. 
Forgetting about those indices 4 such that x; = 0, we find a linear system in 
the x;'s (obtained by writing the previous equality coordinate by coordinate), 
whose associated matrix has rational entries and which has positive solutions. 
We claim that such a system also has positive rational solutions. This follows 
from the following standard: 


Lemma 12.20. if a linear system of equations with rational coefficients has 
real solutions, then the rational solutions are dense in the set of real solutions. 


Proof. As the associated matrix has rational entries, if we put it in reduced 
row-echelon form (i.e. perform Gaussian elimination), we immediately deduce 
that the space of solutions has a basis with rational coordinates. The result 
follows from the density of Q in R. Oo 


We're done. 


‘To motivate the following problems, we need to recall a famous result of 
Dehn, also known as Hilbert’s third problem: given two polyhedra in R3 with 
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equal volume, can we always cut the first one into finitely many polyhedrons 
which can be reassembled to yield the second polyhedron? Dehn showed that 
the answer is negative in a rather complicated way, but nowadays there is a 
very beautiful and short proof due to Hadwiger, which we will sketch now. 
We claim that the regular simplex and the cube yield a negative answer to 
Hilbert’s problem. Let a = arccos 3, the measure of the dihedral angles of the 
regular simplex. It is easy to see that 2 is irrational (using the fact that if r 
and cosra € Q, then cosrm € {4}, 41,0}, which was explained in chapter 
9, section 9.2). Let a1,...,a% be the dihedral angles appearing at the edges 
of the pieces of the dissection and let V be the Q-vector space generated by 
them. As © is irrational, there is a linear form f : V + Q such that f() = 0 
and f(a) = 1. If P is a polyhedron appearing in the dissection, define its 
Dehn invariant by H(P) = 3°, lelf(a), the summation being taken over the 
edges of P, |e| being the length of e and a@ the dihedral angle at e. Simple 
geometry shows that this invariant is additive. Since the invariant of the cube 
is 0 and that of the regular simplex is not, it follows that we cannot cut the 
regular simplex into pieces that can be reassembled to form the unit cube. 
The following problem uses a similar argument, which is extremely useful in 
tiling problems. We need one more preliminary result: any vector space has a 
basis and any linearly independent set of vectors extends to a basis. 


30. An a x 6 rectangle is divided into squares with sides parallel to that of 
the rectangle and with side lengths x;, x2,...,%n. Prove that = and = 
are rational numbers, , : 


Dehn’s theorem 


Proof. The first ingredient is the following 


Lemma 12.21. There exists an additive map f : R > R such that f(x) = 0 
af and only if x is a rational multiple of a. 


Proof. Consider a basis B of R as a Q-vector space and such that a € B. 
Choose a bijection g : B -- {a} -+ B (g exists, because B is an infinite set) 
and define f(a) = 0 and f(z) = g(x) for all « € B — {a}. To define f for any 


12.6. Applications to geometry 579 


real number r, express r as a finite combination r = 216; + cebz + +++ + 2nbdn 
of elements 6; € B with rational coefficients x; and define 


f(e) = x1 f (bi) + vo f(be) + +++ + on f (bn). 


We obtain in this way an additive map f and we have f(r) = 0 if and only ifr 
is a rational multiple of a. Indeed, if f(r) = 0 and r = yb; +22bo+-+-+2nbn 
is as above, then 


xy f (bi) + of (be) +--+ en f (bn) = 0. 


Now the elements f(;) are either equal to 0 (if 6; = 2) or equal to g(b,). 
Moreover, the g(b;) are distinct (because g is injective and the b;’s are distinct) 
and are linearly independent (since they are elements of a basis B). The only 
possibility to have the previous relation is x; = 0 whenever b; 4 x, which 
means precisely that r is a rational multiple of a. Oo 


Define the invariant of a rectangle A with side-lengths x, y by 
A(R) = f(z) f(y). 
The following lemma establishes the additivity of this invariant. 


Lemma 12.22. If a rectangle R is partitioned into finitely many rectangles 


_ Ry, Ra, ..., Rg, then H(R) = H(Ry) + H (Rp) +--+ (Rg). 


Proof. This is essentially obvious by additivity of f: any finite partition with 
rectangles can be refined to a partition which forms a grid (just extend all 
sides of all rectangles in the partition until they meet the sides of R), so it 
is enough to check the claim for a grid. This reduces then to proving that 
if R is partitioned into two rectangles Ry, Re by a line parallel to one of its 
sides, then H(R) = H(R,) + H(Rz). But this is clear, by definition of H and 
because f is additive. 


Coming back to the proof and using the lemma, we obtain 


O= flaf(b) = 3° Fai)’, 


t=1 


580 Chapter 12. Higher Algebra in. Combinatorics 


which yields f(«;) = 0 for all i and so % are rational numbers. Since we could 


have worked with b instead of @ from the very beginning, it follows that o: 
i 


are also rational numbers and the theorem is proved. 


Remark 32.23. In [51], the following similar result is proved using Dehn’s 
original method: a rectangle R has at least, one rational sidelength and is tiled 
by smaller rectangles, each having a rational perimeter, Then all sidelengths 
of R and of the tiling rectangles are rational. 


The following stronger result will be used in the next problems. The proof 
is very similar to that of problem 30. 


Theorem 12.24, (Dehn) Let the rectangle Ro be tiled with rectangles 
Ry, Ro,..., Ry. Let r; be the side-ratio of R;. Then rg € Qlri,ra,... sTn). 


Proof. Suppose that rg ¢ L = Q(ri,r2,--.,7n), so we may pick a basis B of R 
as L-vector space, such that the base and height. 60, kg of Ro are in B. Define 
the invariant of a rectangle bx h to be c1(b)eo(h)—cy (h)e2(b), where c) (x), co(x) 
are the coordinates of x with respect to bo, ho € B. As ¢; are L-linear maps, it 
is easy to check that A; has invariant 0. An argument similar to the one used 
in lemma 12.22 shows that the invariant is additive, so the invariant of Ro is 
the sum of the invariants of the R;’s, that is 0. ‘This contradicts the fact that 
Ro has invariant 1. 


We end this chapter with a truly amazing result, whose proof is taken 
from the beautiful paper [34]. The proof requires the following rather technical 
theorem, which we will take for granted, since the proof would take us quite 
far afield. 


Theorem 12.25. (Wall) If r is an algebraic number all of whose conjugates 
have positive real part, then there exist n 2 1 and positive rational numbers 
€1,€2,-.+50n such that 

1 


l=ar+- = 
Of + = 
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31, Let r be a positive real number. Prove the equivalence of the following 
statements: 


1) the unit square can be tiled with finitely many rectangles similar 
to the 1 x r rectangle and having sides parallel to the sides of the 
square. 


2) r is algebraic and all of its conjugates have positive real part. 
Laczkovich-Szekeres theorem 


Proof. Prepare for a long but exciting battle. Recall that the side-ratio of a 
rectangle is the quotient between its horizontal side and its vertical side. 

Step 0. Let us suppose that r is algebraic and all of its conjugates 
have positive real part. Choose ¢,,cz,... as in Wall’s theorem and cut off a 
rectangle of side-ratio cr from a unit square, by a vertical cut. From the 
remaining rectangle, cut off a rectangle of ratio ae by a horizontal cut. The 
remaining rectangle has side-ratio e3r + =i 
the unit square by rectangles with side-ratios cir, ai ear. mate We conclude 
using the fact that the c;’s are rational. This proves one implication. The 
other implication is harder and requires the following steps. 

Step 1. First, we prove. that if there is a partition of a square with 
rectangles whose side-ratio is r or 1/r, then r is algebraic. Let ¢ be any 
real number which is transcendental over Q(r) and scale the vertical axis 
by a factor of x. Applying theorem 12.24 to this new situation, we deduce 
that z € Q(rz,2z/r), so we have an equality xP(ra,z/r) = Q(rz,x/r) for 
P,Q € QLX.Y)]. As a is transcendental over Q(r), the previous equality yields 
XP(rX, X/r) = Q(X, X/r) in Q(r)[X]. Looking at the leading coefficient, 
we easily obtain a nontrivial polynomial with rational coefficients killing 7, 
thus r is algebraic. ; 

Step 2. We claim that if r > 0 is algebraic and has a conjugate with 
real part smaller than or equal to 0, then it has a conjugate with negative real 
part. Indeed, otherwise r has a purely imaginary conjugate and so there is a 
root z of the minimal polynomial p of r such that P(x) = p(—z) = 0. As p 
is irreducible, p must divide p(X) + p(~X) and so pr) + p(—r) = 0. Thus 
~r <0 is a root of p and we are done. 


-. Repeating the process, we tile 
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Step 3. Assume now that r has a conjugate with negative real part. Let 
p be the minimal polynomial of r and let Q be the companion matrix® of p. 
Let B be a basis of R as Q(r)-vector space, chosen such that 1 € B. If x € R, 
its coordinate with respect to 1 € B is of the form ag +a, +++: +an.37"! with 
a; & Q and we let vz be the column vector with coordinates ag, @1,...,@n—1. 
Thus vy; = Quz. The key point is the following: 


Lemma 12.26. There exists a symmetric matric M € M,(R) such that 
1) v'MQv > 0 for all vy ER”. 
2) There exists s € Q(r) such that ui'Mu, <0. 


The proof of this lemma will be given in the next step. Let us see why 
this finishes the proof of the theorem. Define the invariant of a rectangle b x h 
to be upMup. Tn the usual way, we obtain that this invariant is additive. The 
invariant of a rectangle b x br is uM Upp = vi MQu, > 0 and the invariant of a 
rectangle br x 6 is equal to that of a rectangle b x br, as M is symmetric. So 
rectangles of side-ratio r have nonnegative invariant. On the other hand, 2) 
ensures the existence of an s x s square with negative invariant. This square 
cannot be tiled by rectangles with side-ratio r and we are done. 

Step 4. We prove the lemma. As p has no multiple root (because it is 
irreducible), Q is diagonalizable over C. As it has real entries, it is immediate 
to deduce the existence of a block-diagonal matrix D and of P € GL,,(IR) such 
that Q = PDP~1. Moreover, each block of D has side 1 (and corresponds to a 


real eigenvalue of Q) or is of the form (es ) (and corresponds to complex 


conjugate eigenvalues a t:i8 of Q)}. Let T be the diagonal matrix whose entries 
are the real parts of the eigenvalues of Q. We choose M = (P™1)'T'P-!. Then 
for all v we have 


vo! MQu = (P7'v)'TD(P~1v) = ¥_ wai > 0, 


endomorphism of the Q-vector space Q(r). 
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where w; are the coordinates of P~!y and a; are the squares of the real parts 
of the eigenvalues of Q. So the first, property is satisfied. On the other hand, T 
has a negative entry (as r has a conjugate of negative real part by step 2 and 
the conjugates of r are exactly the eigenvalues of Q), which we may assume 
to be the first one. If e; is the column vector whose first coordinate is 1 and 
the other are 0, we deduce that (Pe1)'MPe, < 0. Simply choose v € Q” 
sufficiently close to Pe, to ensure that v'Mv < 0 and then choose s € Q(r 
such that vs = v. The lemma is proved. 


12.7 Notes 


The solutions to some of the problems in this chapter are due to the 
following people: Alon Amit (problem 11), Tom Belulovich (problem 4), Aart 
Blokhuis (problem 27), Iurie Boreico (problem 15), Zeb Brady (problem 21), 
Alexandru Chirvasitu (problem 13), Darij Grinberg (problems 12, 15, 19), 
Omid Hatami (problem 12), Xiangyi Huang (problems 15, 20), Laszlo Lovasz 
(problem 27), James Merryfield (problem 2), Jorge Miranda (problem 14), 
Fedja Nazarov (problems 7, 24, 29), Hunter Spink (problem 8), J. Steinhardt 
(problem 18), Gjergji Zaimi (problems 5, 7, 9, 10, 11, 20). 
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